Data & AI Platform Engineer

Aditya Kumar

// Building Modern Data & AI Platforms

Designing scalable data pipelines, lakehouse architectures, and AI-powered workflows using Databricks, Spark, Python, SQL, and Cloud technologies.

Explore Projects Architecture Notes Resume

platform_architecture.py

Databricks

Apache Spark

Kafka

Airflow

dbt

Snowflake

AWS

Azure

Docker

PostgreSQL

Delta Lake

Python

PySpark

SQL

MLflow

Terraform

Databricks

Apache Spark

Kafka

Airflow

dbt

Snowflake

AWS

Azure

Docker

PostgreSQL

Delta Lake

Python

PySpark

SQL

MLflow

Terraform

What I Build

Engineering Capabilities

Data Engineering

Building robust pipelines that transform raw data into reliable, queryable assets at scale.

Batch ETLStreaming OrchestrationData Quality

Source→ETL→Warehouse

Analytics Platforms

Designing lakehouse-native data models that power business metrics and executive dashboards.

WarehousingData Modeling MetricsBI

Lakehouse→Warehouse→BI

AI Platforms

Operationalizing machine learning from feature engineering to production model serving.

Feature PipelinesML Workflows Model DeployMLOps

Data→Features→Model→API

Engineering Journey

The Evolution

2023

Python Foundations

Built strong algorithmic thinking and Pythonic programming. OOP, data structures, automation scripting, API consumption.

2024

Machine Learning & AI

Explored supervised and unsupervised learning. Built and evaluated models using scikit-learn, pandas, and NumPy ecosystems.

2025

Cloud Computing & Data Science

Deployed workloads on AWS and Azure. Cloud fundamentals, storage tiers, compute services, and data-at-scale patterns.

2026 — Now

Data Engineering & Databricks

Deep-diving into Spark, Delta Lake, and the Databricks lakehouse paradigm. Building production-grade ETL and streaming systems.

Future

Data & AI Platform Engineering

Converging data engineering, analytics, and AI workloads into unified platforms. Governance, observability, and intelligent systems.

Featured Work

Architecture Designs

Lakehouse Architecture

Production

medallion pattern

Bronze — Raw Ingestion Delta

Silver — Cleansed & Validated Spark

Gold — Business Ready SQL

Real-Time Analytics Platform

Streaming

event-driven pipeline

Event StreamKafka

Stream ProcessingSpark

Lakehouse StoreDatabricks

Live DashboardBI Tool

AI Workflow Platform

AI/ML

ml lifecycle

Curated DataDelta

Feature StoreDatabricks

Model Training & RegistryMLflow

Deployment APIREST

Modern ETL Platform

ETL

batch orchestration

Workflow SchedulerAirflow

Distributed TransformSpark

ACID StorageDelta Lake

Analytics LayerSQL

Knowledge Base

Engineering Notes

Understanding Delta Lake

ACID · Time Travel · Schema Evolution

Spark Transformations Explained

Lazy Evaluation · DAG Optimization

Kafka for Data Engineers

Topics · Partitions · Consumer Groups

Airflow DAG Design Patterns

Idempotency · Backfill · SLAs

Data Modeling Fundamentals

Star Schema · SCD · Normalization

Databricks Lakehouse Architecture

Unity Catalog · Photon · Serverless

Data Engineering Interview Notes

System Design · SQL · Pipeline Questions

More coming soon

Updated regularly

Vision

// mission_statement.txt

My goal is to build expertise in modern Data & AI platforms that unify data engineering, analytics, machine learning, and deployment.

I am particularly interested in how platforms like Databricks are converging traditional data engineering with AI workloads to create scalable, intelligent systems.

Opportunities

Open To

Data Engineering Internships

Databricks & Spark Projects

Open Source Contributions

Technical Discussions

Engineering Collaborations

Let's Connect

Whether you're working on a data platform, want to collaborate on an open-source project, or just want to talk shop — reach out.

LinkedIn linkedin.com/in/aditya-kr86

GitHub github.com/aditya-kr86

Email hi@ankus.dev