prof_pic.jpg

AI Researcher|

Toronto, Canada

đź“„ Resume

Shirin Shahabi

I am a Machine learning Researcher at Inference Labs, working on distributed verified inference. I have been working on Reinforcement Learning applications for the past 4 years, but currently I am more focused on Inference verification and Post training!

  • Verifiable Inference and Privacy Preserving ML
  • Compression, Optimization and a -bit- Inference Quantization
  • Reinforcement learning and Finetuning
  • Distributed inference

I am always looking to connect with people and discuss potential collaborations(the root how knowledge form and transfer) , Let's have a chat!

Skills

Python R PyTorch TensorFlow Hugging Face Apache Spark MongoDB MySQL PostgreSQL Google Cloud AWS Docker Git Kubernetes

Featured Projects

Inference Labs - Verifiable AI & LLMs

  • Distributed Proving Framework - Contributed as a core researcher to DSperse - open-source framework for distributed, verifiable inference. [paper] [sourcecode]
  • JSTProve Integration - Contributed to JSTProve [paper] development and integration with DSperse, a specialized zkML toolkit built on Polyhedra Network's Expander backend to enable AI developers and ML engineers to generate and verify proofs of AI inference.
  • AI Browser Agent Security - Devised and implemented Prompt Injection detection and zk-proxy systems for AI browser agents, leveraging DSperse masking for secure LLM source verification; instead of Prompt Guardlines.
  • Commercial Aviation Vision Model - Developed distributed, verified inference for an external in-production, commercial Vision aviation model using DSperse and JSTProve preserving Inference fidelity, served as auditing and compliance tool.
  • Quantization and Post-training - Contributed to an open-source ZK toolkit for quantization and post-training methods.
  • Verifiable AI Agent - Developed the first verifiable Doom agent, reducing overhead from 1000x to 8x.
  • Modular Proving Framework - Designed a scalable proof-of-inference framework leveraging agentic testing and probabilistic proofs, with no computational overhead, using a JIT design pattern and caching system.
  • DPO-Based Ranking - Implemented ranking profiles using Reinforcement Learning benchmarking DPO, Group Relative Policy Optimization (GRPO) and Reinforcement Learning Teacher (RLT).

Nobitex - Data Science & Finance

  • Customer Clustering at Scale - Developed scalable, ensemble tree-based algorithm leveraging Spark for processing 4M+ customer transactions, driving MAU growth to 8.5M within one year
  • Chain Fraud Detection Dashboard - Developed the Fraud pipeline discovery and classification in OTLP wallet transactions based on blockchain transaction monitoring
  • Marketing Data Pipeline - Led cross-team collaboration to design an end-to-end Marketing Data Modeling ETL Pipeline with ERD system design

SnappMarket - Retail Tech & Optimization

  • Autonomous Inventory System - Developed scalable inventory reordering system for 15 hypermarkets with +8,000 SKUs
  • Automated Shopping Experience - Contributed to a $100M funded project for a fully automated shopping experience (Low-Cost Amazon Go), implementing indoor location tracking in collaboration with Rocket Internet
  • Operational Analytics - Established real-time operational KPIs with automated reporting systems, ensuring data accuracy and driving strategic decisions

Education

M.Sc. Computer Science

McMaster University

  • Thesis: Stochastic Weight Optimization leveraging hierarchical reinforcement learning in Multi-objective network.
  • Member of DFIC Quant Group - Quantitative Finance Department
  • Participated in RLDL Vector Summer School at Vector Institute

MBA, Finance

Sharif University

Direct admission with academic ranking. Merit-based admission by the Office of Exceptional Talents.

B.Sc. System Engineering

Sharif University

Member of National Elite Foundation (INEF)

news

Oct 28, 2025 JSTprove Paper is Live!
Oct 01, 2025 Accepted to RL Residency at Prime Intellect.
Aug 09, 2025 First DSperse Paper and framwork is out!
May 01, 2025 Promoted to AI Researcher at Inference Labs! Starting May 2025, I will be leading research initiatives in distributed verifiable inference and zero-knowledge machine learning.
Mar 19, 2025 Our paper titled “Enhanced Pareto Optimality with Reinforcement Learning Approach” has been accepted for presentation at the CORS 2025 Conference.

latest posts