Raj Krishnamurthy
Senior Data Scientist
// Summary
Senior Data Scientist with a PhD in Statistics and 6+ years of industry experience applying machine learning, NLP, and statistical modeling to solve complex business problems. Published researcher with expertise in deep learning, causal inference, and large-scale data pipelines. Proven ability to translate academic rigor into production-grade ML systems that drive measurable business impact.
// Experience
Senior Data Scientist
Feb 2022 — PresentHelix Health Analytics @ Boston, MA
Lead a team of 4 data scientists building predictive models for patient outcomes, reducing hospital readmission rates by 22% across 15 partner health systems. Developed a clinical NLP pipeline using transformer-based models to extract structured data from 10M+ unstructured medical records with 94% F1 score. Designed and deployed a real-time risk stratification engine on AWS SageMaker, processing 50K+ patient records daily with sub-second latency. Established MLOps best practices including model versioning, automated retraining pipelines, and A/B testing frameworks for production models.
Data Scientist
Jul 2019 — Jan 2022Quanta Financial @ New York, NY
Built machine learning models for credit risk assessment and fraud detection, processing $2B+ in daily transaction data using Python, Spark, and TensorFlow. Developed a time-series forecasting system for portfolio risk that outperformed baseline models by 30%, enabling more accurate capital allocation. Created interactive dashboards and automated reporting pipelines in Tableau and Python, reducing analyst reporting time by 60%. Conducted causal inference analyses using propensity score matching and difference-in-differences to evaluate the impact of policy changes on customer behavior.
Research Data Scientist
Sep 2017 — Jun 2019MIT Lincoln Laboratory @ Lexington, MA
Researched and implemented deep learning models for anomaly detection in large-scale sensor networks, achieving a 15% improvement in detection accuracy. Developed scalable data processing pipelines using Apache Spark and Hadoop to analyze petabyte-scale datasets from distributed sensor arrays. Authored 3 technical reports and contributed to 2 peer-reviewed publications on applications of deep learning for signal processing. Collaborated with interdisciplinary research teams across physics, engineering, and computer science departments.
// Skills
// Languages
English: Native
Tamil: Fluent
Hindi: Advanced
// Certifications
TensorFlow Developer Certificate — Google (Jun 2023)
// Education
Doctor of Philosophy (PhD) in Statistics
Sep 2013 — Aug 2017Massachusetts Institute of Technology, Cambridge, MA
Dissertation: "Bayesian Nonparametric Methods for High-Dimensional Time Series." Advisor: Prof. Elena Marchetti. Published 4 papers in top-tier journals including JASA and Annals of Statistics. NSF Graduate Research Fellowship recipient.
Master of Science in Computer Science
Sep 2011 — May 2013Carnegie Mellon University, Pittsburgh, PA
Concentration in Machine Learning. Coursework in statistical learning theory, probabilistic graphical models, and convex optimization. Research assistant in the Auton Lab working on automated scientific discovery. GPA: 3.9/4.0.
Data Scientist
A research-oriented CV for a senior data scientist with a PhD background, expertise in machine learning and NLP, and peer-reviewed publications.
Sign up free to use this example