Synthetic Test Data Generator (STDG)
GenAI · Data Platform
LLM-powered synthetic data generation using Snowflake metadata to replace production data in dev/QA.
- Reduced test-data preparation effort by 70–80%.
- Enabled compliant rollout across initial 10 global markets.
SnowflakeLLMsAWS
Automated Data Classifier
Governance · PII
Column-level classification for Snowflake using LLM-driven metadata intelligence with explainable reasoning.
- Tagged 100% of Snowflake columns (PII detection + exports).
- Natural-language controls for analysts and data owners.
AtlanSnowflakePIIRAG
AiDE (AI-Driven Engineering)
LLM Platform
Enterprise LLM platform powering code copilots and NL→SQL analytics, optimized for cost and latency.
- Instruction tuning (3B–40B) using DeepSpeed/FSDP + LoRA/QLoRA.
- NL→SQL accuracy improved by 30%; deployed to 3 clients with $500K+ yearly savings.
- High-throughput inference via vLLM (TensorRT) and TGI for 250 concurrent users.
DeepSpeedvLLMTensorRTPostgres
Claims Automation (Smart Virtual Adjuster)
Insurance
End-to-end ML pipeline for claims triage and automation.
- Trained XGBoost on 400k historical claims; achieved 87% accuracy/recall.
- Reduced manual workload by 40% via AWS SageMaker pipeline.
AWS SageMakerXGBoostSnowflake
Fraud Monitoring Solution
Anomaly Detection
Procurement anomaly detection across 10 years of contracting data.
- Flagged $10B worth of tenders for investigation.
- Azure Databricks pipelines + Power BI dashboards.
Azure DatabricksiForestPower BI
Document Intelligence APIs
NLP · OCR
OCR + extraction systems for identity, invoice, and legal documents with production APIs.
- Identity docs extraction at 97% accuracy; invoice/legal at 92%.
- Built 20+ REST APIs using Django/FastAPI, Dockerized on AWS.
FastAPIDjangoELECTRABERT-QA