How does this work?
Amazon RedShift Data Governance accelerator will help in governance of Amazon Redshift clusters. In Amazon Redshift, there are various system level tables/views which capture relevant metadata information of the cluster which we can leverage to create a governance framework. This will aid users by preventing them from creating data swamp and govern the data warehouse in an efficient manner.
Features

Automate data governance in an Amazon Redshift cluster by scheduling queries in the RedShift console or using the RedShift Data API.

Capture the relevant metadata information of the cluster (example: top 50 queries run, resources consumed etc).

Storing the captured data in a view in Amazon RedShift.
Out of the box Metrics
-
Queries Hitting Redshift tables
-
Redshift Schema Usage & Table Scan History by User
-
Redshift load errors
-
COPY command
-
Table insertion
-
RedShift session details
-
Top 50 time consuming queries
-
Unload Files & Tables
-
Libraries in RedShift
-
External tables and schemas in Cluster
-
Access
-
Stored Procedure
-
Spectrum scan errors
-
User wise access roles
-
Vacuum percentage
- Storage tracking
- Vacuum recommendation details
- Show user permission on database and schema level
- Alert tracking
- Unused tables
- Resource usage
- Locked tables
Benefits of this Accelerator
- Gather more Insights about Cluster Activities via Visualisation
- Improve Performance of Amazon Redshift with the help of Redshift usage data
- Get Information on User Access and Usages for each user
- Track your storage information for each table through visuals
Reference Architecture

Sample Visualization

Our capability





let's
connect
Drop us an enquiry and we will get back in next 24 hours.