New Jersey · Open to senior AI/ML roles

Shipping ML systems at the
edge of finance, risk,
and retrieval.

I'm Sai Nikhil — an AI/ML engineer and data scientist with 5+ years building production ML and Generative AI systems. Currently at Goldman Sachs, building financial research and risk-intelligence platforms — RAG pipelines, agentic LLM workflows, and models for risk, anomaly, and fraud detection.

5+ yrs
building and shipping production ML, Generative AI, and data-science systems across financial services
35%
higher information-retrieval accuracy in production RAG pipelines for financial research
25%
improvement in risk-prediction and fraud-detection model accuracy via tuning + feature work
40%
faster query response across a 100 TB+ analytics warehouse for risk and regulatory reporting
Sai Nikhil Mattapalli — portrait
Fig. 01 Sai Nikhil Mattapalli · NJ · AI/ML Engineer
— 00

Hello — I'm Sai Nikhil.

I build production ML and Generative AI systems for financial services. My day-to-day at Goldman Sachs is shipping financial research and risk-intelligence platforms — RAG pipelines, agentic LLM workflows, and models for risk prediction, anomaly detection, and fraud.

Before Goldman Sachs I spent two and a half years at Cognizant as a Data Scientist — building real-time market models, reinforcement-learning execution strategies, anomaly-detection frameworks, and NLP sentiment pipelines over large-scale financial data. I hold a Master's in Computer Science from SUNY Albany.

My work sits where modeling meets engineering: latency budgets, evaluation that survives production, and pipelines that don't fall over when the data changes underneath them. Offline accuracy is the start of the job — shipping is the rest.

Role
AI/ML Engineer · Goldman Sachs
Based in
New Jersey, USA
Education
M.S. CS · SUNY Albany
Specialties
Financial ML · RAG · MLOps · GenAI
Open to
Senior AI/ML Engineer roles
— 01

Work history

Five-plus years across financial services and enterprise data science — from EDA to production GenAI.

Goldman Sachs

AI/ML Engineer

Building AI systems for financial research and risk intelligence — RAG-based knowledge discovery, agentic LLM workflows, and ML models for risk, anomaly, and fraud detection across large-scale financial data.

  • Built an AI-powered Financial Research & Risk Intelligence Platform with Python, LangChain, Azure OpenAI, and Hugging Face Transformers to automate financial knowledge discovery and analysis.
  • Designed RAG pipelines with LangChain, OpenAI embeddings, and vector databases — 35% higher information-retrieval accuracy for financial research and knowledge discovery.
  • Engineered scalable ETL pipelines with PySpark, SQL, and Apache Spark to process over 2 TB of structured and unstructured financial data.
  • Built ML models for risk prediction, anomaly detection, and document classification (Scikit-learn, TensorFlow) — 25% higher accuracy for risk-assessment and fraud-detection workflows.
  • Accelerated model training and large-scale data processing with JAX, cutting experimentation and training time by 30%.
  • Built multi-step reasoning workflows with LLM agents and tool-calling to automate financial KPI analysis and investment research; fine-tuned HF Transformers for 25% better extraction and classification.
  • Deployed ML and GenAI apps on AWS with Docker, MLflow, and CI/CD, orchestrated with AWS Lambda / Step Functions and Vertex AI Pipelines35% faster model lifecycle.
  • Python
  • LangChain
  • Azure OpenAI
  • HuggingFace
  • PySpark
  • JAX
  • Scikit-learn
  • TensorFlow
  • AWS Lambda
  • MLflow
  • Vertex AI

Cognizant

Data Scientist

Built real-time ML and data-science systems for trading, execution, and risk — market models, RL execution strategies, anomaly detection, and NLP sentiment over large-scale financial data.

  • Developed and deployed a real-time ML model (Python, TensorFlow, scikit-learn) for market-fluctuation analysis over price trends, order-book dynamics, and macro indicators — 15% better predictive accuracy.
  • Engineered an RL-driven execution model (OpenAI Gym, TensorFlow) that adapts trade placement to liquidity and market microstructure — 5% lower transaction costs.
  • Designed an anomaly-detection framework with Apache Kafka and PySpark to flag market manipulation in real time — 30% fewer false positives, aligned to SEC & FINRA standards.
  • Optimized SQL on a 100 TB+ Azure Synapse warehouse (partitioning, distribution, indexing) — 40% faster query response for risk and regulatory reporting.
  • Led statistical experiments (NumPy, SciPy, StatsModels) on new trading signals — 10% better market-making and arbitrage efficiency.
  • Built interactive Tableau dashboards for live trading metrics, risk exposure, and PnL — 25% faster stakeholder decisions.
  • Designed an NLP pipeline (BERT, NLTK) over earnings reports and financial news — 18% higher sentiment-prediction accuracy for risk teams.
  • Python
  • TensorFlow
  • scikit-learn
  • OpenAI Gym
  • Kafka
  • PySpark
  • Azure Synapse
  • BERT
  • NLTK
  • Tableau
— 02

Signature projects

Production ML systems and open-source work where the architecture, the model, and the outcome line up.

P/01
Goldman Sachs·2023–25·Financial ML

Financial Research & Risk Intelligence Platform

End-to-end platform that automates financial knowledge discovery and risk analysis. Python + LangChain + Azure OpenAI + Hugging Face Transformers over a 2 TB+ corpus of structured and unstructured financial data, with Scikit-learn / TensorFlow models for risk prediction, anomaly detection, and document classification.

35%retrieval accuracy
25%risk-model accuracy
2TB+data processed
  • Python
  • LangChain
  • Azure OpenAI
  • HuggingFace
  • Scikit-learn
  • TensorFlow
P/02
Goldman Sachs·2024–25·Generative AI

RAG Financial Knowledge Discovery

Agentic RAG system for financial research — LangChain with OpenAI embeddings over vector databases, plus multi-step LLM reasoning and tool-calling that automate KPI analysis and investment-research workflows. Fine-tuned Transformers sharpen extraction and classification.

35%retrieval accuracy
25%extraction lift
Agentstool-calling
  • LangChain
  • OpenAI
  • Vector DB
  • Agents
  • RAG
  • PEFT
P/03
Cognizant·2020–22·Trading Systems

Real-time Anomaly Detection & RL Execution

Real-time financial systems for execution and surveillance — an RL-driven execution model (OpenAI Gym + TensorFlow) tuned to liquidity and market microstructure, and a Kafka + PySpark anomaly-detection framework flagging market manipulation in real time, aligned to SEC & FINRA standards.

30%fewer false positives
5%lower transaction cost
SECFINRA aligned
  • Apache Kafka
  • PySpark
  • OpenAI Gym
  • TensorFlow
  • Anomaly Detection
— 03

Technical stack

Tools I've shipped with — grouped by the part of the system they serve.

Languages

  • Python
  • SQL
  • R

ML & Deep Learning

  • Scikit-learn
  • TensorFlow / Keras
  • Random Forests
  • SVM
  • Neural Nets
  • K-Means / KNN
  • JAX

Generative AI

  • LangChain
  • OpenAI API
  • Hugging Face
  • Azure OpenAI
  • RAG
  • Prompt Eng
  • Embeddings
  • LoRA / PEFT

Data & Databases

  • Pandas
  • NumPy
  • SciPy
  • PySpark
  • Kafka
  • FAISS / Pinecone
  • PostgreSQL
  • MongoDB

Cloud & Pipelines

  • AWS S3 / EMR / Redshift
  • AWS Lambda
  • Azure Synapse
  • Snowflake
  • BigQuery
  • Apache Airflow
  • Vertex AI
  • Step Functions

MLOps & Viz

  • Docker
  • Kubernetes
  • Jenkins
  • CI/CD
  • MLflow
  • Git
  • Tableau
  • Power BI
— 04

Credentials & focus

Formal education, certifications, and the domains where the work is happening.

Education

M.S.
Computer Science

SUNY Albany · Albany, NY

Graduate coursework in machine learning, data systems, and applied AI.

B.Tech.
Computer Science

Bharath University · India

Domains

  • Financial MLRisk prediction, anomaly & fraud detection
  • GenAIRAG, agentic workflows, prompt engineering
  • MLOpsCI/CD for ML, MLflow, Vertex AI, monitoring
  • BackendServerless, AWS Lambda, Step Functions, REST
  • DataPySpark, Kafka, Snowflake, vector stores
  • AnalyticsTableau, Power BI, EDA, hypothesis testing
— 05

Let's build
something that matters.

Open to senior AI/ML and Generative-AI roles. Happy to consult on financial ML, RAG architecture, or productionizing LLM systems.