Curriculum Vitae
Education #
Ph.D. in Computer Science, University of California, Irvine
📍Irvine, CA
Jun. 2021 - Mar. 2026
Computational Genomics · Interpretable AI · Multi-omic Integration
- Dissertation: From Modality Translation to Regulatory Reasoning: Interpretable AI for Single-Cell Multi-Omic Systems
- Advisor: Prof. Jing Zhang
M.S. in Computational Biology, Carnegie Mellon University
📍Pittsburgh, PA
Aug. 2018 - May. 2020
- Cumulative GPA: 3.85
B.S. in Computer Science with a Specialization of Bioinformatics, University of California, San Diego
📍La Jolla, CA
Sep. 2015 - Jun. 2018
- Cumulative GPA: 3.84 / Major GPA: 3.91
- Advisor: Awards: Provost Honor, Cum Laude
Research #
🏛️Zhang Lab, University of California, Irvine
📍Computational Genomics · Single-Cell Multi-Omics
🗓️Jan. 2025 – Present
•
Aligned spatial transcriptomics, epigenomics, and histology via multimodal representation learning for in situ states.
•
Designed robust cell-type mapping under sparsity and partial observability across tissues and protocols.
•
Trained cross-modality recovery models to denoise and impute gene expression at higher resolution.
🗓️Jan. 2022 – Mar. 2026
•
Modeled opioid–HIV response programs from consortium-scale single-cell profiles in neural and immune cells.
•
Built reproducible QC, batch correction, and harmonization pipelines for cross-site integration.
•
Standardized data assets: curated matrices, metadata schemas, and visualization-ready outputs for collaborators.
•
Coordinated multi-institution analyses and experimental follow-ups to validate computational hypotheses.
🗓️Jan. 2024 – Jun. 2025
•
Integrated snRNA-seq, snATAC-seq, and spatial Xenium across >2M nuclei in cohorts.
•
Built cross-cohort harmonization to mitigate protocol, site, and donor-level confounding effects.
•
Reconstructed cell-type-specific regulatory networks linking stress pathology to gene regulation and chromatin.
•
Delivered analysis-ready atlases, annotations, and figures for consortium publication and reuse.
🗓️Jun. 2023 – Oct. 2024
•
Developed scACT to translate between paired scRNA-seq and scATAC-seq with high fidelity.
•
Designed modality-bridging embeddings robust to sparsity, library-size shifts, and batch effects.
•
Benchmarked against state-of-the-art translators across datasets, cell types, and perturbations.
•
Enabled imputation of missing modalities to support downstream regulatory inference and integration.
🗓️Sep. 2022 – Jun. 2024
•
Integrated multi-cohort brain single-cell epigenomic datasets for atlas-scale regulatory analysis.
•
Built reproducible pipelines for QC, harmonization, and confound-aware cross-study integration.
•
Learned robust chromatin representations to stabilize cell-state inference under sparsity and batch shifts.
•
Produced cell-type annotations and regulatory programs to support consortium figures and publications.
🗓️Jun. 2023 – Feb. 2024
•
Curated genomic and clinical datasets from public repositories; harmonized formats and metadata.
•
Developed ML curriculum and delivered hands-on workshops for biomedical researchers.
•
Coordinated with partner institutions to deploy AI workflows for translational health studies.
🗓️Jun. 2021 – Jul. 2022
•
Introduced Translator, a transfer-learning framework pretraining scATAC-seq models on high-quality references.
•
Learned reusable chromatin embeddings that improved clustering and cell-type annotation under extreme sparsity.
•
Performed domain adaptation to heterogeneous protocols, reducing batch effects and sequencing-depth shifts.
•
Benchmarked across cohorts and tissues; delivered stable peak–gene links and regulatory program discovery
🏛️Carnegie Mellon University
📍Biomedical Machine Learning · Multimodal Modeling
🗓️Jul. 2020 – Jun. 2021
•
Supervisor: Prof. Ziv Bar-Joseph; Collaborator: Astarte Medical
•
Built ML models predicting neonatal growth trajectories from clinical and microbiome time series.
•
Modeled missing and irregular EHR records with Hidden Markov Models and imputation.
•
Suggested feeding interventions via model-based counterfactual analysis and cohort stratification
🗓️Nov. 2018 – May. 2020
•
Supervisor: Prof. Jian Ma
•
Linked histology images and transcriptomes via graph neural networks for joint tissue profiling.
•
Learned shared multimodal embeddings using autoencoders for alignment and cross-modality retrieval.
•
Enabled spatially aware interpretation via feature attribution over learned tissue representations.
🏛️University of California, San Diego
📍Biomedical Databases
🗓️Jun. 2016 – Jun. 2018
•
Supervisors: Prof. Xiaoqian Jiang, Prof. Shuang Wang
•
Mapped clinical variables into OMOP CDM schemas to improve interoperability across databases.
•
Built visualization tools for cohort exploration, missingness auditing, and temporal trend analysis.
•
Contributed to OHDSI efforts on scalable, global health data standardization and reuse.
Work Experience #
🏛️University of California, Irvine
🗓️Sep. 2021 - Mar. 2026
•
Developed interpretable ML for single-cell multi-omics: translation, integration, and regulatory reasoning.
•
Built transfer-learning and generative models for modality prediction under sparsity and batch shift.
•
Collaborated in PsychENCODE, SCORCH, and PTSD/MDD consortia to deliver pipelines and atlases.
•
Co-authored peer-reviewed publications; released reproducible code, datasets, and documentation for reuse.
•
Mentored junior researchers on model design, evaluation, and scientific communication.
🗓️Sep. 2021 - Mar. 2025
•
Head TA for ICS 6D Discrete Mathematics and TA for CS 141 Programming Languages in multiple offerings (247+ students).
•
Designed problem sets, quizzes, and exams; aligned objectives, rubrics, and learning outcomes.
•
Built Gradescope autograding and Q&A workflows to scale feedback for large enrollments.
•
Led sections, review sessions, and office-hour systems across in-person and hybrid delivery.
🏛️Carnegie Mellon University
🗓️Jul. 2020 - Jun. 2021
•
Built predictive models for neonatal growth using longitudinal clinical and microbiome time series.
•
Engineered cohort-aware feature pipelines with missingness modeling and temporal alignment of visits.
•
Applied probabilistic sequence models to handle irregular sampling and sparse EHR data.
•
Evaluated generalization via site-aware splits, calibration, and uncertainty-aware model selection.
•
Produced reproducible training/inference workflows and analysis reports for clinical collaborators.
🗓️Jan. 2020 - May. 2020
•
Led recitations and office hours; supported the rapid transition to remote/hybrid instruction.
•
Developed programming assignments, solutions, and rubrics aligned with course learning objectives.
•
Built autograding and feedback workflows to scale timely, consistent grading for large enrollments.
•
Provided targeted support on debugging, scientific computing, and quantitative modeling fundamentals
🏛️University of California, San Diego
🗓️Mar. 2016 - Mar. 2018
•
Led small-group and one-to-one tutoring in programming, data structures, and data-science fundamentals.
•
Coordinated weekly with instructors to design sections, refine assignments, and target common misconceptions.
•
Built fair, rubric-driven assessments; improved clarity of exam questions and scaled feedback with Gradescope.
•
Provided inclusive, scaffolded support via flexible office hours and high-touch debugging guidance.
Skills #
- Programming: Python, R, Java, C/C++, SQL, Ocaml, Haskell, Prolog, Matlab, WebDev
- AI/Machine Learning Tools: PyTorch, Tensorflow, Conda, Huggingface
- Bioinformatics Tools: Seurat, ArchR, Signac, Pegasus, Squidpy, Cellranger, Samtools, Bowtie, David
- Teaching: Tools Canvas, BlackBoard, Gradescope, Zybooks
- Languages: Chinese (Mandarin), English, Japanese
Services #
Journal Reviewer #
- BioData Mining
- BMC Bioinformatics
- BMC Genomics
- Discover Computing
- Discover Genetics and Evolution
- npj Artificial Intelligence
- Scientific Reports
- The Journal of Supercomputing
- Translational Psychiatry
Conference Reviewer #
- 2024 IEEE International Conference on Bioinformatics and Biomedicine
- 2024-2025 Conference on Information and Knowledge Management
- 2024-2026 International Conference on Intelligent Systems for Molecular Biology
- 2026 ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Program Committee Member #
- 2024-2025 International Conference on Artificial Intelligence for Medicine, Health, and Care
Awards #
- Best Student Paper Award, 2024 Asian Conference on Machine Learning
- Recipient, NSF Student Travel Award
There are no articles to list here yet.