Sagar Das | Data/ML Engineer

About Me

Background & Education

I hold a Master's degree in Information Systems from the University of Maryland, where I specialized in data engineering and machine learning. My academic journey equipped me with a strong foundation in computer science principles, statistical analysis, and modern data architectures.

My professional path has been driven by a passion for solving complex data challenges at scale. I've worked with Fortune 500 companies, academic institutions, and innovative startups, consistently delivering solutions that transform raw data into actionable insights.

What I Bring to the Table

Technical Excellence

• Scalable Architecture: Designed systems handling 20M+ daily events
• Performance Optimization: Reduced pipeline runtimes from 6 hours to 45 minutes
• AI/ML Integration: Built RAG systems processing 100K+ documents
• Cloud Expertise: Multi-cloud deployments on AWS, GCP, and Azure

Leadership & Impact

• Cross-functional Leadership: Led teams delivering enterprise data platforms
• Innovation Driver: Pioneered AI adoption reducing analysis time by 60%
• Mentorship: Guided junior engineers in modern data practices
• Business Value: Delivered solutions serving 6 Fortune 500 clients

Data Defender Challenge

Test your skills in this interactive data challenge game

Use arrow keys to move, SPACEBAR to shoot!

Score: 0

High: 0

❤️Lives: 3

Level: 1

Defend against data threats! Use arrow keys or mouse to move, SPACEBAR to shoot.
📄 Data (10pts) |🐛 Bug (20pts) |🦠 Virus (30pts)

Professional Journey

Experience

Innovation & Research

University of Maryland

Data Specialist

Sep 2023 – May 2025

College Park, MD

Applying cutting-edge data engineering and ML methods to solve real-world data challenges at scale.

Platform Engineering

Tiger Analytics

Senior Software Engineer - Data Platform

Jul 2021 – Jul 2023

Chennai, India

Led the efforts to build a self-serve data-fabric on AWS and GCP, used by 6 Fortune 500 clients to streamline enterprise data operations and analytics.

Data Engineering

Xenonstack Pvt. Limited

Intern & Software Engineer

Jan 2019 – Nov 2019

Chandigarh, India

Building the technical foundation that would shape my entire career in data engineering and MLOps.

Featured Projects

Data Fusion Engineering

Google CloudDataflowBigQueryPython

Comprehensive data pipeline solution for processing and analyzing large-scale datasets using Google Cloud Platform services.

Intelligent Record Management

PythonNLPElasticsearchFastAPI

AI-powered document processing system with semantic search capabilities for efficient information retrieval.

Loan Default Prediction System

PythonScikit-learnPandasMLflow

Machine learning model to predict loan defaults using historical data and advanced feature engineering.

Data Prep for Fintech Analytics

Apache SparkAWSPythonAirflow

ETL pipeline for processing financial transaction data and generating actionable insights.

Monitoring EKS Cluster

KubernetesPrometheusGrafanaAWS

Comprehensive monitoring solution for Kubernetes clusters with alerting and visualization.

Sports Analytics System

PythonPandasTableauStatistics

Data analysis platform for sports performance metrics and predictive modeling.

Featured Articles

Insights and tutorials on data engineering, machine learning, and cloud technologies

Loading articles...

View All Articles

Tools & Technologies

Python

SQL

Scala

Bash

FastAPI

LangChain

DBT

Great Expectations

Pytest

Apache Spark

Apache Beam

Kafka

Airflow

Apache Iceberg

Airbyte

Terraform

Docker

Deequ

Redshift

BigQuery

Postgres

MySQL

DynamoDB

DuckDB

Neo4j

Elasticsearch

FAISS

BERT

LangChain

AWS Bedrock

RAG Systems

Knowledge Graphs

Transformers

LLM APIs

Python

SQL

Scala

Bash

FastAPI

LangChain

DBT

Great Expectations

Pytest

Apache Spark

Apache Beam

Kafka

Airflow

Apache Iceberg

Airbyte

Terraform

Docker

Deequ

Redshift

BigQuery

Postgres

MySQL

DynamoDB

DuckDB

Neo4j

Elasticsearch

FAISS

BERT

LangChain

AWS Bedrock

RAG Systems

Knowledge Graphs

Transformers

LLM APIs

Methods & Concepts

Core Engineering

Data Structures & Algorithms

Distributed Systems

MLOps

CI/CD Pipelines

GitOps

REST APIs

Backend Development

Server Side Programming

Data Architecture

Data Lakehouse Architecture

Data Mesh

Data Governance

ETL/ELT Pipelines

Real-time Data Processing

Data Warehousing

Data Cataloging

Streaming Analytics

AI & ML

RAG Systems

Vector Databases

Fine-tuning

Prompt Engineering

Model Deployment & Monitoring

Feature Engineering

Automated Machine Learning

Explainable AI

Advanced Systems

Event-Driven Architecture

Data Observability

Data Security

Federated Learning

Edge Computing

IoT Data Integration

Data Quality

Metadata Management

Academic Journey

Education

Building a strong foundation through academic excellence and continuous learning in technology and data science.

Master of Information Management

University of Maryland

2023 - 2025

College Park, MD

GPA3.97/4.0

Key Achievements

Specialized in Data Science and Machine Learning

Specialization

Received a full tuition scholarship for the entire course duration

Achievement

Participated in intramural sports and data science club activities

Activities

Bachelor of Engineering in Information Technology

Panjab University Chandigarh

2015 - 2019

Chandigarh, India

GPA7.72/10.0

Key Achievements

Thesis on classifying Wireless Sensor Networks using ML algorithms

Research

Member of the Panjab University Entrepreneurship Development Cell

Leadership

Participated in inter-university football championships

Sports

Let's Talk About Your Data Needs

Whether you're looking to build a data platform, optimize existing pipelines, or explore how AI/ML can enhance your data strategy, I'd love to hear from you.

Phone

+1 (240) 495-9874

sagardas.work@gmail.com

linkedin.com/in/sagardas08

GitHub

github.com/sagar8080

Available for consulting, full-time opportunities, and collaborations