cv
Basics
Name | Shirin Shahabi |
Phone | 365-883-4747 |
shirin.shahabinejad@gmail.com |
Work
-
2024.05 - Present Hamilton, Canada
Machine Learning Engineer
- Led the development of verifiable LLM on Qwen 2.5 with no overhead of computation and designed Automated testing using Multi Context Protocol (MCP) sampling.
- Led the Quantization and post-training team and Infrastructure on Open-source LLM Models.
- Developed the first Verifiable Agentic AI with proof of inference, enabling verifiable model predictions to meet legal and insurance liability requirements.
- Engineered a comprehensive automated SR&ED auditing documentation pipeline using Google's Flan T5, seamlessly integrating GitHub, Slack messages, and Jira to streamline employee documentation.
- Designed and implemented a ranking system based on DeepFM and BERT, utilizing Reinforcement Learning with Human Feedback (RLHF) and Direct Preference Optimization (DPO), containerized with Docker.
- Developed Autonomous Index Agent with optimizing liquidity and slippage.
- Improved inference time by 23% by developing a Commit-Reveal strategy on Kubernetes using GCP instances.
- Developed Cosmos SDK modules for decentralized AI inference platforms.
- Developed a Retrieval-Augmented Generation (RAG) system using LLaMA 3 for the R&D team specializing in Zero-Knowledge Cryptography deployed on Graph VertexAI.
- Developed Verifiable Ray-tracing physical simulation based on Nvidia Warp, Google DeepMind Mujoco, and Autodesk XLB frameworks.
- Applied structured pruning and quantization techniques on time series prediction models using NVIDIA's pruning methods for 8-billion parameter LLaMA and ResNet models without loss of accuracy.
-
2022.12 - 2024.09 Hamilton, Canada
Machine Learning Developer
- Developed an Trading Bot, with enhanced risk mitigation strategies to prevent market crashes, leveraging FastDQN reinforcement learning to outperform human traders by 185%.
- Implemented multi-objective bi-strategy West America's rail-highway transportation optimization on high-performance computing cluster using Gurobi multi-threading and shell scripting focusing on class-based Supply Chain Management.
- Implemented a multi-objective bi-strategy optimization model using hierarchical reinforcement learning, improving conventional Multi-Armed Bandit problems by 13% with Meta's PEARL framework.
- Instructed 'Statistical Data Analysis in Healthcare' using Python (EHealth 705), with materials available at: github.com/shirin-shahabi/PythonTutorial_Ehealth705.
-
2022.02 - 2023.01 Remote
Data Scientist
- Developed a scalable, robust, and interpretable ensemble tree-based algorithm for clustering over 4 million customers by leveraging Spark for distributed transaction data processing, resulting in increasing MAU from 4M to 8.5M in under a year.
- Led cross-team collaboration with the Product Team to design an end-to-end Marketing Data Modeling and ETL Pipeline using Django and ORM Design.
- Defined +50 financial KPIs and identified company priorities and targeted measures for strategic growth.
- Utilized Web Engage for Customer Journey Mapping, achieving a 35% increase in conversion rate.
- Conducted market research and customer survey analysis, driving a 10% increase in user engagement.
- Collaborated on Power BI and EDA Course for Managers, promoting dashboard adoption for data-driven decision-making.
- Conducted Instagram and Twitter sentiment analysis, extracting key insights to inform marketing strategies, resulting in a 30% increase in retention rate.
-
2022.01 - 2022.02 Tehran, Iran
Financial Analyst
- Designed a comprehensive business plan for a local +12 million dollar hotel construction project.
- Conducted cash flow projections and performed a feasibility study.
-
2020.10 - 2021.12 Remote, Canada
Data Analyst
- Contributed to a visionary product with over $100 million in funding for a fully automated shopping experience (Low-Cost Amazon Go), emphasizing indoor location tracking, in collaboration with Rocket Internet.
- Optimized inventory reordering for 15 hypermarkets with a fully automated algorithm for +8,000 SKUs.
- Collaborated on loyalty, fraud detection, retention, targeted marketing, and churn prediction programs.
- Mentored two interns in the Data Team, now successful Data Scientists one pursuing PhD in University of Michigan and the other doing Master of Computer Science at Waterloo University.
- Delivered +100 optimized SQL queries, benefiting Marketing, Operations, and Finance teams.
- Established real-time operational KPIs with automated email reports for timely insights, ensuring data accuracy.
- Defined packaging strategies and presented enhanced user experiences, resulting in a direct 10% increase in revenue.
Education
Skills
Programming Languages | |
Python | |
R | |
SQL (MySQL, Postgres, MongoDB) |
Cloud Platforms | |
Google Cloud Platform (Kubernetes, Vertex AI Engine) | |
AWS (Lambda, EC2, SageMaker) |
Machine Learning Frameworks | |
TensorFlow | |
PyTorch | |
Scikit-learn | |
Hugging Face |
AI and NLP Tools | |
LangChain | |
NLTK | |
SpaCy | |
ONNX |
MLOps and DevOps Tools | |
Docker | |
Git | |
CI/CD (Argo workflow, Redis, Spark) |
Web and API Technologies | |
REST APIs | |
Streamlit | |
Shiny App |
Data Engineering | |
ETL Pipeline Development | |
Data Modeling | |
Data Streaming | |
Distributed Computing |
Expertise | |
Reinforcement Learning | |
Large Language Models (LLMs) | |
Retrieval-Augmented Generation (RAG) | |
Natural Language Processing (NLP) | |
Inference Engineering | |
Compression Techniques | |
Pre-training and Fine-Tuning | |
Predictive Modeling | |
Statistical Analysis | |
Fraud Detection | |
Verifiable Inference | |
Agentic AI |
Operation Research | |
Mathematical Optimization | |
Integer Programming | |
Stochastic Modeling | |
Discrete-event Simulation | |
Meta-heuristic Search Algorithms | |
Process Mining |