Built Data Lake + Real-Time Analytics Stack
Designed and built an end-to-end data lake architecture on S3 with Apache Iceberg, coupled with a real-time analytics stack using StarRocks. This replaced a fragmented system of ad-hoc queries and slow batch jobs.
Key Results
- Real-time data freshness across all analytics
- Self-serve analytics for the entire org
- 90% reduction in data pipeline complexity