Turning raw data into actionable intelligence.
Hi, I'm Shashank. I specialize in building large-scale data pipelines to AI-driven decision systems, I bridge data science, machine learning, and real-world impact.
"Data is the new oil. It's valuable, but if unrefined it cannot really be used. My job is to refine it."
Turning data, models, and systems into real-world impact.
I am a Data Scientist and Machine Learning Engineer with 3+ years of experience building end-to-end data and AI systems that move beyond experimentation into production. My background spans applied statistics, scalable data engineering, and modern machine learning, with a strong focus on delivering measurable business outcomes.
I have designed and deployed forecasting, analytics, and automation pipelines using Snowflake, Azure, and cloud-native tooling, driving up to 60 percent efficiency gains and 25 percent accuracy improvements across large-scale, real-world datasets. I enjoy working at the intersection of data, engineering, and stakeholders to turn complex problems into reliable systems.
Currently, I specialize in Generative AI, LLMs, and MLOps, building high-performance inference engines, RAG-based applications, and OpenAI-compatible APIs with streaming, observability, and production-grade constraints. I believe strong models matter, but robust systems, scalability, and trust matter more.
Outside of work, I contribute to open-source projects, experiment with new AI architectures, and write about applied machine learning, system design, and responsible AI practices.
Featured Projects
Production-Grade Video Summarization and Q&A Platform using Whisper, and Distributed Workers
Scalable asynchronous pipeline for video summarization and Q&A using Whisper, FastAPI, and Redis.
High-Performance LLM Inference Engine with Grammar-Constrained Decoding
Built a local LLM inference engine implementing and enabling deterministic and grammar-constrained text generation through token masking and incremental parsing.
Technical Proficiency
Building production-grade data and AI systems across machine learning, large language models, data engineering, and cloud-native MLOps.
Languages
Machine Learning & AI
Data Engineering
MLOps
Cloud Platforms
Analytics & Visualization
Database Systems
Experimental Design
Professional Experience
A track record of delivering high-impact solutions across various industries.
Data Scientist | Supply Chain | Demand Forecasting
Sep 2025 - Dec 2025Built and deployed an end-to-end demand forecasting system using Snowflake and Python, translating complex supply chain data into scalable, production-ready forecasting solutions. Focused on accuracy, automation, and stakeholder usability across large SKU portfolios.
- Designed and deployed a demand forecasting pipeline with feature engineering and LightGBM, serving forecasts via Azure-backed infrastructure and Streamlit dashboards for planners.
- Automated vendor capacity and purchase buy planning through allocation logic and retraining pipelines, eliminating manual Excel workflows and improving forecast scalability and auditability.
- Collaborated with supply chain, sourcing, and engineering teams to design data schemas, implement ETL pipelines, validate models, and deliver stakeholder-ready documentation.
Lead Data Analyst | Supply Chain | Last Mile
Dec 2023 - Jul 2024Led data-driven cost, productivity, and performance optimization initiatives across regional operations, partnering closely with operations and leadership teams to translate analytics into measurable business outcomes.
- Achieved 98% accuracy in employee database cleansing and validation, establishing a new company-wide benchmark for data quality and reporting reliability.
- Improved regional productivity by 60% through analytics-backed manpower allocation strategies and performance optimization across delivery hubs.
- Designed and delivered hub-level performance dashboards, enabling leadership to identify bottlenecks, track KPIs, and drive data-informed operational decisions.
Software Engineer Intern | FinTech | EdTech
Jun 2022 - Jun 2023Contributed to enterprise-scale software delivery for ONELERN and NPCI, working across automation, testing, and deployment workflows in regulated and high-reliability environments. Gained strong foundations in system quality, performance, and production readiness.
- Designed and implemented automation frameworks across four large-scale projects, reducing manual QA effort by 30 percent and improving release reliability.
- Collaborated with cross-functional teams on the NPCI Central Bank Digital Currency (CBDC) platform, supporting deployment workflows, performance monitoring, and production readiness.
- Contributed to test strategy improvements through data-driven analysis, reducing testing cycle times and improving feedback loops across multiple enterprise systems.
Education
M.S. in Data Science
University at Buffalo (SUNY)
Specialization in Machine Learning and Artificial Intelligence.
Interned at New Era Cap as an Data Scientist.
GPA: 3.9/4.0
BTech. in Computer Science and Engineering
Lovely Professional University
Engineering Major: Data Science.
GPA: 3.8/4.0
Achievements & Certifications
Data Analytics Essentials by Cisco
Completed with certification via Microsoft & LinkedIn Learning. (2023)
Python and SQL for Data Science
Earned certification in Python and SQL for Data Science through Scaler Academy. (2023)
Career Essentials in Data Analysis by Microsoft and LinkedIn Learning
Completed the data analysis track with certification from Microsoft and LinkedIn. (2023)
Top 50 SQL By Leetcode
Earned a badge for solving the top 50 SQL interview challenges on LeetCode. (2023)
Advanced SQL Certificate by HackerRank
Cleared the Advanced SQL assessment and received HackerRank recognition. (2023)
MongoDB M001
Completed the M001 course and received certification from MongoDB University. (2022)
GFG DSA self paced course
Successfully completed and certified by GeeksforGeeks' for the DSA self-paced program. (2021)
Codechef Snackdown 2021
Qualified for the CodeChef SnackDown Round-1A in a global programming event. (2021)
JUMPSTART Finale Contestant
PreFinalist among 150+ students in a national hackathon by Publicis Sapient. (2021)
Modern Big Data Analysis with SQL
Specialization Certificate
in Big Data SQL (2021)
Course 1: Foundations for Big Data Analysis with SQL
Course 2: Analyzing Big Data with SQL
Course 3: Managing Big Data in Clusters and Cloud Storage.
Crash Course on Python by Google
Gained hands-on experience in Python programming via Google's interactive course. (2021)
Best #Top 63 World Rank in HackerEarth Programming
Ranked in the top 63 worldwide on HackerEarth after 3 months of consistent coding. (2021)
Problem Solving Certificate by HackerRank
Earned HackerRank's certification by completing the problem-solving challenge. (2020)
Let's connect and build something impactful.
I'm currently open to new opportunities in Data Science and AI/ML engineering. Whether
you have a question or just want to say hi, I'll try my best to get back to you!
Open to full-time roles, internships, and meaningful collaborations in data science and
AI/ML
shashankmankala.5@gmail.com
Phone
+1 716 547 1045
Location
NY, USA (Remote available)