CV
Education
-
2023 - 2026 Atlanta, GA
B.S. in Computer Science
Georgia Institute of Technology
-
GPA: 4.00 Expected Graduation: May 2026 - Concentration: Machine Learning and Data Systems
- Relevant Coursework: Machine Learning, Databases, Computer Systems and Networks, Data Structures and Algorithms
-
Experience
-
2025 - 2025 New York, NY
Software Engineer Intern
Two Sigma Investments
- Designed key performance metrics through collaboration with the product team that provide traders actionable feedback on their ideas, driving deeper platform engagement and higher-quality work.
- Engineered Python library to perform automated daily metric calculations and large-scale backfills.
- Built and deployed ETL pipeline using AWS Lambda and Step Functions to process data in Amazon RDS with PostgreSQL, including Datadog monitoring, unit tests, and full documentation.
-
2025 - present Atlanta, GA
Research Assistant
Georgia Tech — EI & HCAI Lab
- Advised by Dr. Mark Riedl.
- Researching explainable AI for multi-step agentic AI systems, enabling users to understand why tasks are delegated to specific models and supporting meaningful human oversight of automated pipelines; paper accepted at HCXAI Workshop at CHI 2026.
- Developing counterfactual explanation methods for agentic workflows that provide users with actionable insights into how workflow outcomes could have differed under alternative conditions, helping identify failure modes and build appropriate trust.
- Investigating machine unlearning methods to identify how training data composition correlates with language model performance on specific task domains, with the goal of enabling targeted capability control.
-
2024 - present Atlanta, GA
Research Assistant
Georgia Tech Research Institute (GTRI) — ARCAID Lab
- Advised by Dr. Clayton Kerce and Dr. Pat Langley.
- Devising search algorithms that utilize AI agents to optimize scientific claim decomposition and verification, improving factual accuracy assessment in complex reasoning settings.
- Researching techniques for learning generalized solutions to planning problems via hierarchical problem networks, aiming to reduce search complexity and enable more efficient automated reasoning.
- Built a RAG-enhanced chatbot over a corpus of research papers and a code generation pipeline for creating, executing, and debugging Python simulations, empowering researchers to rapidly onboard to specialized topics and test ideas experimentally.
-
2024 - present Atlanta, GA
Research Assistant
Georgia Tech — Financial Services Innovation Lab (FinTech Lab)
- Advised by Dr. Sudheer Chava.
- Designed BELLA, a system to profile LLMs’ strengths and weaknesses and recommend the optimal model for a task based on budget constraints and necessary skills; presented as first-author paper (poster) at MLSys Young Professionals Symposium 2025.
- Co-created FLaME, a holistic finance benchmark for evaluating language models across sentiment analysis, numerical reasoning, document understanding, and question answering; co-first author paper accepted at ACL Findings 2025.
- Benchmarked LLM prompting methods for financial sentiment classification, achieving 10% relative improvement over baselines via automated prompt optimization.
-
2024 - 2024 Cedar Rapids, IA
Machine Learning Engineering Intern
Raytheon Technologies (RTX)
- Developed a production-approved Python framework to integrate explainable AI into existing ML workflows, enabling post-hoc interpretability for deployed models.
- Integrated MLflow to enhance experiment tracking and reproducibility in MLOps processes, saving 50+ hours/month in manual logging overhead.
- Improved object detection throughput on a constrained security system by 5x via multithreading and pipeline optimization.
-
2023 - 2023 San Diego, CA
Bioinformatics Research Assistant
Palmer Lab, UC San Diego
- Optimized ML algorithm for phenotype trait prediction based on genetic data through feature reduction with Python & R, reducing input features from 7.3M to 50k with minimal (< 0.01%) performance degradation.
- Co-authored 2 peer-reviewed journal articles on genetic prediction (JARO and bioRxiv).
- Performed GWAS pipeline preparation and data quality control for large-scale genomics datasets.
Publications
-
2026 Explainable Model Routing for Agentic Workflows
HCXAI Workshop at CHI 2026
Okamoto, M., Erol, A., & Riedl, M.
-
2026 Counterfactual Explanations for Agentic Workflows
HCXAI Workshop at CHI 2026
Singh†, M., Kim†, G., Okamoto†, M., et al. († equal contribution)
-
2025 FLaME: Holistic Finance Language Model Evaluation
ACL Findings 2025
Matlin†, G., Okamoto†, M., Pardawala†, H., Yang, Y., & Chava, S. († equal contribution)
-
2025 Trust by Design: Skill Profiles for Transparent, Cost-Aware LLM Routing (BELLA)
MLSys Young Professionals Symposium (YPS) 2025
Okamoto, M., Erol, A., & Matlin, G.
-
2025 Genome-Wide Association Study of Age-Related Hearing Loss in CFW Mice
Journal of the Association for Research in Otolaryngology (JARO)
Polesskaya, O., Boussaty, E., Cheng, R., et al.
-
2023 RATTACA: Genetic Predictions in Heterogeneous Stock Rats
bioRxiv
Johnson, B. B., Sanches, T. M., Okamoto, M. H., et al.
Service
- Reviewer, ACL Rolling Review (ARR) — January 2026
Skills
Programming Languages: Python, Java, SQL, R, Bash, C++, HTML, CSS, JavaScript, Golang
Machine Learning & AI: PyTorch, TensorFlow, scikit-learn, Pandas, NumPy, OpenCV, Explainable AI, LLMs, NLP, Automated Reasoning, Machine Unlearning
Cloud & Infrastructure: Amazon Web Services (AWS), Docker, Git, GitHub, Datadog, Jira, Confluence
Full Stack & Data Engineering: ReactJS, Flask, Gradle, REST APIs, Firebase, PostgreSQL, MySQL, SQLite, Microsoft SQL Server