Data Profiling
Understand your data at a glance — distributions, patterns, anomalies, and health metrics, automatically computed.
Know Your Data Before You Use It
Profiling gives your team the confidence to trust, query, and build on datasets they discover in the catalog.
Column-Level Statistics
Every profiled column gets a statistical summary — null rate, unique count, min/max values, most frequent values, and distribution histograms. For numeric columns, you also get mean, median, standard deviation, and percentiles. All computed directly on your warehouse.
- Null rate and completeness percentage
- Unique value count and cardinality ratio
- Min, max, mean, median, and std deviation
- Value distribution histograms
Volume & Pattern Anomalies
Profiling tracks row counts, schema changes, and statistical patterns over time. When a table's volume drops unexpectedly, a column's null rate spikes, or a new column appears, Qarion flags the anomaly and creates an alert.
- Row count trend monitoring
- Null rate spike detection
- Schema change tracking (new/dropped columns)
- Distribution drift alerts
Table-Level Health Score
Each table gets a composite health score based on completeness, freshness, schema stability, and documentation coverage. The score aggregates into data product and space-level health views, giving leadership visibility into data reliability.
- Completeness, freshness, stability metrics
- Aggregated health at product and space level
- Historical trend visualization
- Health-based data prioritization
Data Classification
Profiling automatically detects potential PII, email addresses, phone numbers, credit card patterns, and other sensitive data based on column names and sample values. Classification results feed into governance rules and compliance workflows.
- PII detection (name, email, phone, SSN patterns)
- Credit card and financial data detection
- Custom classification rules
- Feeds into compliance and access controls
Profile Your Data in Minutes
Connect a data source and let Qarion's profiling engine give you immediate data intelligence.