Homepage - Huanmi Tan

Huanmi Tan

谭欢秘, pronounced as /'hwɑnmi tæn/

I recently graduated with a Master's degree in Software Engineering from SCS of Carnegie Mellon University and will join AWS in Bellevue as a Software Development Engineer in June 2025. Before that, I obtained my Bachelor's degree in Software Engineering from Tongji University. During my Bachelor degree, I interned at Shanghai AI Lab and ByteDance AI Lab.

My background in software engineering has provided me with strong engineering skills. Based on this foundation, my primary experiences and interests include but are not limited to:

NLP: LLM alignment, Low-resource NLP, Distillation
MLOps: ML Pipelines, Anomaly Detection

You could also refer to the projects page for details. Additionally, I am currently keen to explore the fields of LLMs and MLSys.

I am actively seeking a PhD position for Fall 2026. I am also open to discussing a wide range of interesting topics, including but not limited to potential collaboration opportunities. Please feel free to reach out if you'd like to have a chat!

Email Google Scholar GitHub Twitter LinkedIn CV

Education

Carnegie Mellon University

Master's in Software Engineering

Aug. 2023 - Dec. 2024
Tongji University

B.E. in Software Engineering

Sep. 2019 - Jul. 2023
North Carolina State University

GEARS Research Program

Jun. 2022 - Aug. 2022

Experience

Amazon Web Services

Software Development Engineer

Jun. 2025 - Present
ByteDance AI Lab

Intern | Volcano Machine Translation, AI Lab

Mar. 2023 - Aug. 2023
Shanghai AI Lab

Research Intern | AI for Imaging Group

Nov. 2022 - Feb. 2023

News

2024

I received my Master's degree from CMU!

Dec 18

Selected Publications (view all )

VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation

Anjiang Wei, Huanmi Tan, Tarun Suresh, Daniel Mendoza, Thiago SFX Teixeira, Ke Wang, Caroline Trippel, Alex Aiken

arXiv preprint Under review. 2025

VeriCoder is a model for RTL (Register Transfer Level) code generation, fine-tuned on a novel dataset that is functionally validated via feedback-directed refinement. Unlike prior datasets that only ensure syntactic correctness, our dataset guarantees that each RTL design passes automatically generated unit tests aligned with its natural language specification. Our key contributions include: (1) a large-scale dataset of 125,000+ examples with simulation-passing RTL designs, (2) a feedback-driven construction methodology that iteratively refines designs and tests based on test results, (3) superior performance with up to +71.7% relative improvement on VerilogEval benchmarks, and (4) comprehensive resources including dataset, model weights, inference scripts, and training pipeline.

[arXiv]

VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation

Anjiang Wei, Huanmi Tan, Tarun Suresh, Daniel Mendoza, Thiago SFX Teixeira, Ke Wang, Caroline Trippel, Alex Aiken

arXiv preprint Under review. 2025

[arXiv]

All publications

Honors & Awards

Second Prize of Citi Cup (Fintech Innovation Application Competition)

2022
Academic Excellence Scholarship of Tongji University (Top 10%)

2022
Third Prize in Mobile Application Innovation Competition

2021
First Prize of East China Region in Mobile Application Innovation Competition of CCCC

2021
Academic Excellence Scholarship of Tongji University (Top 10%)

2021
Academic Excellence Scholarship of Tongji University (Top 10%)

2020

Visitor Map

Warning

Action required

Education

Experience

News

Selected Publications (view all )

VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation

VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation

All publications

Honors & Awards