prof_pic.jpg

AI Researcher|

Toronto, Canada

πŸ“„ Resume

Shirin Shahabi

I am a machine learning researcher at Inference Labs, focused on distributed verified inference, reinforcement learning, and post-training systems.

  • Verifiable Inference and Privacy Preserving ML
  • Compression, Optimization and a -bit- Inference Quantization
  • Reinforcement learning and Finetuning
  • Distributed inference
  • LLM and Agent Evaluation

I am always looking to connect with people and discuss ideas, collaboration, and how knowledge is built and transferred.

Skills

Python R PyTorch TensorFlow Hugging Face Apache Spark MongoDB MySQL PostgreSQL Google Cloud AWS vLLM SGLang Docker Git Kubernetes

~/interests

β”œβ”€β”€ 2027?/  Local Inference, RL simulators, edge &amp on-device verification
β”œβ”€β”€ 2026/  inference serving,  agent security protocols, scalable reliable agentic system, diffusion
β”œβ”€β”€ 2025/   verification, RL, Posttraining, World models
β–Έ earlier branches (2007–2024)
β”œβ”€β”€ 2024/   Blockchain, Gen AI and Agentic risk assessment
β”œβ”€β”€ 2023/   cryptography, autonomous driving, Hierarchical RL simulation
β”œβ”€β”€ 2022/   Data Pipeline and Blockchain Eng, crypto services(L2)
β”œβ”€β”€ 2021/   reinforcement learning, Network and Graphs
β”œβ”€β”€ 2020/   first real job in data science, Indoor location, Platform Data science
β”œβ”€β”€ 2019/   networks, supply chain, drone transport
β”œβ”€β”€ 2018/   business & system design, platforms
β”œβ”€β”€ 2017/   auto trading, crypto
β”œβ”€β”€ 2016/   robotics
β”œβ”€β”€ 2015/   math olympiad
β”œβ”€β”€ 2011/   sci-fi novels, books, literature
β”œβ”€β”€ 2010/   aerospace & expeditions
β”œβ”€β”€ 2009/   playing chess
β”œβ”€β”€ 2008/   computers
└── 2007/   dinosaurs :)

news

Feb 02, 2026 Developed ZKProxy β€” a universal verification protocol that wraps any guardrail engine.
Jan 20, 2026 Published our SOTA LLM agentic evaluation framework β€” TruthTensor is live!
Dec 12, 2025 DSperse is now patented β€” distributed verifiable inference!
Nov 02, 2025 Reached real-time video detection β€” YOLO-based sports detection verification and edge-device Tesla detection verification.
Oct 28, 2025 JSTprove Paper is Live!
Oct 01, 2025 Accepted to RL Residency at Prime Intellect.
Aug 09, 2025 First DSperse Paper and framwork is out!
May 01, 2025 Promoted to AI Researcher at Inference Labs! Starting May 2025, I will be leading research initiatives in distributed verifiable inference and zero-knowledge machine learning.
Mar 19, 2025 Our paper titled β€œEnhanced Pareto Optimality with Reinforcement Learning Approach” has been accepted for presentation at the CORS 2025 Conference.
Dec 20, 2024 I successfully defended my Master’s thesis at McMaster University, focusing on β€œMulti-Objective Bi-Level Hierarchical Reinforcement Learning.” This work enhances traditional stochastic optimization and multi-armed bandit approaches to offer interpretable decision-making frameworks.
Jul 01, 2024 I participated in the Deep Learning and Reinforcement Learning (DLRL) 2024 summer school at Vector Institute.
Apr 01, 2024 I have started my Full-time position at Inference Labs to work on Verifiable Inference.
Sep 01, 2022 I was accepted to McMaster University under the supervision of Prof. Manish Verma with an Excellence Scholarship.

latest posts