Platform Features

Three Engines, One Intelligent Data Platform

Coomia integrates Flink-native pipelines, a Palantir-grade ontology, and an AI agent super-team into a unified, privately deployed platform — from raw data to actionable intelligence, without the Airbyte + dbt + Dagster tool-chain sprawl.

Flink-native Pipelines

AI-driven pipeline authoring on Apache Flink with full-lifecycle governance — from one-click database onboarding to production-grade batch + streaming, real lineage, and contract-first quality enforcement.

Pipeline Builder— orders_realtime · draft
coomia.app
Sources
mysql.prod_erpCDC
orders
order_items
customers
products
postgres.crmCDC
kafka.events
Auto-detected
4 entity types · 12 relations
Prompt › Build a realtime orders pipeline joining customers, partitioned by day, with medallion layering.
Generate
Flink SQLbatch + streaming✓ validated
CREATE TABLE dwd_orders_enriched (
  order_id STRING,
  customer_id STRING,
  amount   DECIMAL(18,2),
  dt       DATE,
  PRIMARY KEY (order_id) NOT ENFORCED
) WITH (
  'connector' = 'doris',
  'table.identifier' = 'dwd.orders_enriched'
);

INSERT INTO dwd_orders_enriched
SELECT o.id, o.customer_id, o.amount, DATE(o.created_at)
FROM bronze_orders o
LEFT JOIN bronze_customers c
  ON o.customer_id = c.id;
Contract
order_idSTRING
customer_idSTRINGPII
amountDECIMAL
dtDATE
Lineage · 3 hopsfresh
bronze_orders
dwd_orders
dws_daily
Flink cluster readyDoris proj_onto_paasmainEst. cost: $0.12/hr
Lineage— dws.daily_revenue · column-level
coomia.app
BronzeSilverGold✓ no drift
bronze_ordersidcustomer_idamountstatuscreated_atdwd_orders_enrichedorder_idcustomer_idamount_usdis_paiddtdws_daily_revenuedtrevenueorders
Source: real Flink job + Doris query planSync'd 12s ago

AI-Powered Pipeline Builder

Describe your data flow in natural language. Coomia generates production-ready Flink SQL plus Doris DDL with Bronze → Silver → Gold layering — unified batch + streaming, in minutes, not days.

One-Click Database Onboarding

Point Coomia at a MySQL, PostgreSQL, Doris, or ClickHouse database. It scans the schema, detects relationships, drafts entity types, and deploys CDC pipelines — all in a single flow.

Data Quality Engine

AI auto-generates quality rules with 30+ rule types via Great Expectations. Statistical profiling, anomaly detection, PII identification, and real-time quality scoring.

Real Data Lineage

True lineage sourced from Flink jobs and Doris queries — not mocked. Column-level tracking with visual impact analysis before any change ships.

Data Contracts & Schema Drift

Contract-first code generation ensures Schema precedes code. Automatic schema drift detection, breaking change alerts, and AI-generated repair proposals.

Asset Catalog & Profiling

URN-unified asset catalog with multi-source sync across Flink, Doris, and the ontology layer. Column-level profiling with distribution histograms, null rates, and PII detection.

Data Service Publishing

One-click conversion of SELECT queries to REST APIs with dynamic parameters, version management, rate limiting, caching, and access control.

Pipeline Template Marketplace

Pre-built templates for common industry scenarios (ERP, CRM, e-commerce). Optimization engine auto-scans for performance bottlenecks and suggests improvements.

Ontology Intelligence

Palantir Foundry-grade ontology system — 16 modules covering object types, actions, rules, decisions, worlds, vector search, matching functions, and AI-augmented generation.

Ontology Explorer— Customer · proj_onto_paas
coomia.app
Object Types16
Customer128,412
Order2.1M
Product18,204
OrderPlacedE2.1M
PaymentFailedE3,208
Shipment1.8M
World Branch
main → scenario-q2
placedcontainsships_tolives_attriggersfromCustomerOrderProductShipmentOrderPlacedPaymentFailed
Graph · 6 types · 11 relations
ObjectEvent
Customerv3.1
id
STRING · PK
emailPII
STRING
lifetime_valuederived
DECIMAL
risk_scorevector
FLOAT
tier
ENUM
Actions
UpdateTier
FlagForReview

Object Type Management

Define ObjectType schemas with attributes, relations, and rules. Stored in Doris VARIANT JSON with full version control. Changes propagate automatically downstream.

Object Explorer

Instance CRUD, semantic search, filtering, relationship graph visualization, and timeline view. Navigate your domain model interactively with full context.

Action Engine

5 pure data operations (CREATE/MODIFY/DELETE OBJECT/LINK) with transaction support. Atomic execution ensures data consistency across complex multi-step actions.

Rule Engine & Automation

YAML-based business rules with Rete network reasoning. TRIGGER_DECISION triggers, derived attribute computation, and event → rule → recommendation → execution closed loop.

Decision Studio

Interactive scenario exploration with rule evaluation, option generation, impact assessment, and execution. AI-assisted recommendations with explainable confidence scores.

World Management & What-if

Branch scenario management with Nessie integration. Before/After split-screen comparison, impact propagation animation, and multi-scenario side-by-side evaluation.

Event-Driven Architecture

Kafka-powered real-time event processing with lifecycle transitions. Event → Rule → Recommendation → Confirmation → Execution full automation pipeline.

Agent Runtime

DecisionAgent, QueryAgent, and OntologyAssistant with 4 agent capability types. AI agents that understand your ontology and execute actions autonomously.

Vector Query & Graph Traversal

HNSW vector embedding search for semantic similarity. Multi-hop graph traversal and aggregation analysis across relationship networks.

6 Matching Functions

Vector matching, attribute matching, capability matching, team matching, risk matching, and temporal matching — find the right objects across any dimension.

Function Runtime

User-defined Python functions in sandboxed execution. Derived attribute computation with full isolation, versioning, and performance monitoring.

OAG: Ontology Augmented Generation

Structured RAG grounded in your ontology. Prompt templates, decision explanation generation, and citation tracking — AI answers backed by real business objects.

Analytics AI

Five specialized AI agents collaborate as a super-team — delivering NL insights, auto dashboards, root cause analysis, forecasting, causal analysis, and 5 advanced analysis models.

Dashboard Agent— Revenue Overview · auto-generated
coomia.app
Revenue Overview
AI-generatedOntology-aware
7d30dQTDYTD
GMV
$14.8M
+8.4%
New accts
812
+12%
Churn
1.2%
+0.3pp
NRR
118%
+4pp
Daily GMVlast 30 days
Funnel
Visit
24.1k
Signup
14.9k
Trial
9.2k
Paid
3.4k
By segment
Retention
Top regions
US-West
32%
EU-Central
24%
APAC
21%
US-East
14%
Analytics Copilot— What drove Q2 revenue drop?
coomia.app
RCA AgentAnalytics AgentForecastgrounded ✓
Revenue dropped −14.2% in Q2, driven primarily by the Enterprise segment (−23%). Root cause narrowed to 3 churned accounts via the Customer → ChurnEvent path.
Revenue
$8.42M
−14.2%
Active accts
2,104
−48
NRR
102%
−6pp
Monthly revenue · rolling 12Mvs. forecast
drop
JFMAMJJASOND
You
What drove the Q2 revenue drop?
RCA Agentstep 1/3
Scanning 14 anomaly candidates via Customer → ChurnEvent
Segment breakdown
Churned accounts
Timeline backtrack
Can you forecast recovery?
Ask anything grounded in your ontology…

Analytics Agent

Natural language → SQL with ontology-aware semantic understanding. Aggregation, distribution, TopN queries with dynamic charts, confidence scoring, and insight annotations.

Dashboard Agent

AI auto-generates interactive dashboards with intelligent chart type selection and layout optimization. Ontology-aware with object drill-through to Object Explorer.

DQ Agent

Automated data quality rule generation with 30+ rule types via Great Expectations integration. Quality reports, anomaly alerts, and trend monitoring.

RCA Agent

Root cause analysis with anomaly metric tracking and impact chain visualization. Causal analysis canvas with event backtracking and timeline replay.

Forecasting Agent

Time-series prediction with confidence interval estimation and long-term trend forecasting. Prediction curves with visual confidence bands.

6 Structured Renderers

Table, card, chart, metric, timeline, and form renderers. CopilotKit integration for real-time streaming answers and multi-turn conversation context.

Funnel & Retention Analysis

Multi-step conversion funnel with drop-off identification. Retention heatmaps, cohort comparison, and churn curves — powered by Doris WINDOW_FUNNEL & RETENTION.

Path & Interval Analysis

Event sequence visualization with Sankey flow diagrams (forward/reverse paths). Time-between-events distribution with box plots and percentile-based bottleneck identification.

Attribution Analysis

5 attribution models (first-touch, last-touch, linear, time-decay, position-based) for multi-touchpoint contribution measurement and channel effectiveness evaluation.

Pattern Discovery Engine

Vector clustering, graph pattern recognition, time-series trend identification, and cross-dimension correlation discovery. AI proactively surfaces insights you didn't know to look for.

Ontology-Grounded Copilot

Page context auto-injection, conversational action execution, and entity auto-linking. The copilot understands what you're looking at and acts accordingly.

Common Capabilities

Enterprise-grade infrastructure that powers every module.

Project Management

Multi-org, multi-project workspaces with role-based access (Admin/Developer/Viewer) and complete resource isolation.

Git-style Data Versioning

Nessie-powered branches for data. Zero-copy cloning, atomic merges, and Merge Request workflows with code + data diff.

RBAC Access Control

Fine-grained permissions at world, project, and field levels. SSO, SAML, OIDC support with data classification and masking.

Audit & Compliance

Complete operation history with immutable records. Query audit logs, access tracking, usage statistics, and compliance proof.

Multi-tenant Isolation

Three-level isolation: Org → Project → Branch. Data, compute, and access boundaries enforced at every layer.

Business Glossary

Term definitions, synonym management, asset association, and review workflows. Unified business vocabulary across the platform.

Ready to Get Started?

Deploy Coomia in your own infrastructure. Request a demo or start a 14-day free trial today.

Features — Three Engines, One Intelligent Data Platform · Coomia