This article will go over how data elements are stored in Infogix Data360 DQ+ installations.
Data elements are stored in the following areas:
ApplicationDB: Postgres (Enterprise and Cloud editions)
- Definition configurations (Analysis, Data Store, etc.)
- Data View creation & loading
- Workflow management
- Application management information
- Audit trails, Exceptions, Process Status
- Attachments, Comments, Annotations links
- Execution History (Enterprise deployments, and Cloud edition DQ+ 4.3 and newer)
Compute Cluster: Hadoop (Enterprise and Cloud editions)
- Compute job submissions
- Data Prep, Validation, and Analysis job execution inputs and outputs
- Process/job logs
- Process Status Monitoring
ComputeDB: Vertica (Enterprise) or RedShift (Cloud)
- Contains data stored within DQ+ Data Views
- Processes queries made by DQ+ Dashboards
DynamoDB (Cloud edition prior to DQ+ 4.3)
- Execution History
Comments
1 comment
Also, in an Enterprise (customer on-premise) installation at least, internal data stores seem to be stored - or at least cached - in the customer's HDFS file system in the directory /user/sagacity/data/.
Please sign in to leave a comment.