Abstract dark blue digital nodes and connections network background
Available for new experience and projects

Turning raw data into actionable intelligence.

Hi, I'm Shashank. I specialize in building large-scale data pipelines to AI-driven decision systems, I bridge data science, machine learning, and real-world impact.

format_quote

"Data is the new oil. It's valuable, but if unrefined it cannot really be used. My job is to refine it."

Profile Photo
terminal

Current Status

Building next-gen LLMs

About Me

Turning data, models, and systems into real-world impact.

I am a Data Scientist and Machine Learning Engineer with 3+ years of experience building end-to-end data and AI systems that move beyond experimentation into production. My background spans applied statistics, scalable data engineering, and modern machine learning, with a strong focus on delivering measurable business outcomes.

I have designed and deployed forecasting, analytics, and automation pipelines using Snowflake, Azure, and cloud-native tooling, driving up to 60 percent efficiency gains and 25 percent accuracy improvements across large-scale, real-world datasets. I enjoy working at the intersection of data, engineering, and stakeholders to turn complex problems into reliable systems.

Currently, I specialize in Generative AI, LLMs, and MLOps, building high-performance inference engines, RAG-based applications, and OpenAI-compatible APIs with streaming, observability, and production-grade constraints. I believe strong models matter, but robust systems, scalability, and trust matter more.

Outside of work, I contribute to open-source projects, experiment with new AI architectures, and write about applied machine learning, system design, and responsible AI practices.

20+ Projects
2B+ Records Scale
5+ AI Systems Built

Featured Projects

Computer screen displaying code and graphs
LLM Inference vLLM Python

Production-Grade Video Summarization and Q&A Platform using Whisper, and Distributed Workers

Scalable asynchronous pipeline for video summarization and Q&A using Whisper, FastAPI, and Redis.

Computer screen displaying code and graphs
LLM Inference vLLM Python

High-Performance LLM Inference Engine with Grammar-Constrained Decoding

Built a local LLM inference engine implementing and enabling deterministic and grammar-constrained text generation through token masking and incremental parsing.

Dashboard with financial data charts
LLMs RAG LangChain

TutorMind - Personalized AI

AI-powered assistant that answers context-specific questions using user-uploaded documents, LLMs, and vector-based retrieval.

View All Projects arrow_forward

Technical Proficiency

Building production-grade data and AI systems across machine learning, large language models, data engineering, and cloud-native MLOps.

code

Languages

Python SQL R C++ MATLAB Javascript HTML CSS
psychology

Machine Learning & AI

Supervised/Unsupervised ML Deep Learning NLP LLMs RAG Vector Databases PyTorch TensorFlow LangChain vLLM
data_object

Data Engineering

ETL/ELT Data Pipelines Data Modeling Data Warehousing Apache Spark Apache Kafka Apache Airflow
settings_system_daydream

MLOps

Docker Kubernetes DagsHub MLflow CI/CD Git Distributed Computing
cloud

Cloud Platforms

AWS Azure Google Cloud (GCP) Snowflake Databricks
bar_chart

Analytics & Visualization

Streamlit Tableau Power BI SPSS Dashboard Design Business Metrics Looker Studio
database

Database Systems

MySQL PostgreSQL MongoDB BigQuery GraphDB Firestore Vector Store
science

Experimental Design

A/B Testing Hypothesis Testing Statistical Inference Casual Inference Experimental Design

Professional Experience

A track record of delivering high-impact solutions across various industries.

Data Scientist | Supply Chain | Demand Forecasting

Sep 2025 - Dec 2025
New Era Cap | Buffalo, United States

Built and deployed an end-to-end demand forecasting system using Snowflake and Python, translating complex supply chain data into scalable, production-ready forecasting solutions. Focused on accuracy, automation, and stakeholder usability across large SKU portfolios.

  • Designed and deployed a demand forecasting pipeline with feature engineering and LightGBM, serving forecasts via Azure-backed infrastructure and Streamlit dashboards for planners.
  • Automated vendor capacity and purchase buy planning through allocation logic and retraining pipelines, eliminating manual Excel workflows and improving forecast scalability and auditability.
  • Collaborated with supply chain, sourcing, and engineering teams to design data schemas, implement ETL pipelines, validate models, and deliver stakeholder-ready documentation.

Lead Data Analyst | Supply Chain | Last Mile

Dec 2023 - Jul 2024
Shadowfax | Bangalore, India

Led data-driven cost, productivity, and performance optimization initiatives across regional operations, partnering closely with operations and leadership teams to translate analytics into measurable business outcomes.

  • Achieved 98% accuracy in employee database cleansing and validation, establishing a new company-wide benchmark for data quality and reporting reliability.
  • Improved regional productivity by 60% through analytics-backed manpower allocation strategies and performance optimization across delivery hubs.
  • Designed and delivered hub-level performance dashboards, enabling leadership to identify bottlenecks, track KPIs, and drive data-informed operational decisions.

Software Engineer Intern | FinTech | EdTech

Jun 2022 - Jun 2023
Frugal Testing | Hyderabad, India

Contributed to enterprise-scale software delivery for ONELERN and NPCI, working across automation, testing, and deployment workflows in regulated and high-reliability environments. Gained strong foundations in system quality, performance, and production readiness.

  • Designed and implemented automation frameworks across four large-scale projects, reducing manual QA effort by 30 percent and improving release reliability.
  • Collaborated with cross-functional teams on the NPCI Central Bank Digital Currency (CBDC) platform, supporting deployment workflows, performance monitoring, and production readiness.
  • Contributed to test strategy improvements through data-driven analysis, reducing testing cycle times and improving feedback loops across multiple enterprise systems.

Education

school
2024 - 2025

M.S. in Data Science

University at Buffalo (SUNY)

Specialization in Machine Learning and Artificial Intelligence. Interned at New Era Cap as an Data Scientist.
GPA: 3.9/4.0

school
2019 - 2023

BTech. in Computer Science and Engineering

Lovely Professional University

Engineering Major: Data Science.
GPA: 3.8/4.0

Achievements & Certifications

article

Data Analytics Essentials by Cisco

Completed with certification via Microsoft & LinkedIn Learning. (2023)

article

Python and SQL for Data Science

Earned certification in Python and SQL for Data Science through Scaler Academy. (2023)

article

Career Essentials in Data Analysis by Microsoft and LinkedIn Learning

Completed the data analysis track with certification from Microsoft and LinkedIn. (2023)

badge

Top 50 SQL By Leetcode

Earned a badge for solving the top 50 SQL interview challenges on LeetCode. (2023)

article

Advanced SQL Certificate by HackerRank

Cleared the Advanced SQL assessment and received HackerRank recognition. (2023)

lightbulb

MongoDB M001

Completed the M001 course and received certification from MongoDB University. (2022)

emoji_events

GFG DSA self paced course

Successfully completed and certified by GeeksforGeeks' for the DSA self-paced program. (2021)

sports_score

Codechef Snackdown 2021

Qualified for the CodeChef SnackDown Round-1A in a global programming event. (2021)

emoji_events

JUMPSTART Finale Contestant

PreFinalist among 150+ students in a national hackathon by Publicis Sapient. (2021)

emoji_events

Modern Big Data Analysis with SQL

Specialization Certificate in Big Data SQL (2021)
Course 1: Foundations for Big Data Analysis with SQL
Course 2: Analyzing Big Data with SQL
Course 3: Managing Big Data in Clusters and Cloud Storage.

article

Crash Course on Python by Google

Gained hands-on experience in Python programming via Google's interactive course. (2021)

emoji_events

Best #Top 63 World Rank in HackerEarth Programming

Ranked in the top 63 worldwide on HackerEarth after 3 months of consistent coding. (2021)

article

Problem Solving Certificate by HackerRank

Earned HackerRank's certification by completing the problem-solving challenge. (2020)

Get In Touch

Let's connect and build something impactful.

I'm currently open to new opportunities in Data Science and AI/ML engineering. Whether you have a question or just want to say hi, I'll try my best to get back to you!
Open to full-time roles, internships, and meaningful collaborations in data science and AI/ML

mail

Email

shashankmankala.5@gmail.com

phone

Phone

+1 716 547 1045

location_on

Location

NY, USA (Remote available)

Virtual Assistant
Hello! I'm Shashank's virtual assistant. I can help you navigate his portfolio. What would you like to know?