Skip to main content

Curriculum Vitae


Education
#

Ph.D. in Computer Science, University of California, Irvine
📍Irvine, CA
Jun. 2021 - Mar. 2026
Computational Genomics · Interpretable AI · Multi-omic Integration
  • Dissertation: From Modality Translation to Regulatory Reasoning: Interpretable AI for Single-Cell Multi-Omic Systems
  • Advisor: Prof. Jing Zhang
M.S. in Computational Biology, Carnegie Mellon University
📍Pittsburgh, PA
Aug. 2018 - May. 2020
  • Cumulative GPA: 3.85
B.S. in Computer Science with a Specialization of Bioinformatics, University of California, San Diego
📍La Jolla, CA
Sep. 2015 - Jun. 2018
  • Cumulative GPA: 3.84 / Major GPA: 3.91
  • Advisor: Awards: Provost Honor, Cum Laude

Research
#

🏛️Zhang Lab, University of California, Irvine
📍Computational Genomics · Single-Cell Multi-Omics
🗓️Jan. 2025 – Present
Aligned spatial transcriptomics, epigenomics, and histology via multimodal representation learning for in situ states.
Designed robust cell-type mapping under sparsity and partial observability across tissues and protocols.
Trained cross-modality recovery models to denoise and impute gene expression at higher resolution.
🗓️Jan. 2022 – Mar. 2026
Modeled opioid–HIV response programs from consortium-scale single-cell profiles in neural and immune cells.
Built reproducible QC, batch correction, and harmonization pipelines for cross-site integration.
Standardized data assets: curated matrices, metadata schemas, and visualization-ready outputs for collaborators.
Coordinated multi-institution analyses and experimental follow-ups to validate computational hypotheses.
🗓️Jan. 2024 – Jun. 2025
Integrated snRNA-seq, snATAC-seq, and spatial Xenium across >2M nuclei in cohorts.
Built cross-cohort harmonization to mitigate protocol, site, and donor-level confounding effects.
Reconstructed cell-type-specific regulatory networks linking stress pathology to gene regulation and chromatin.
Delivered analysis-ready atlases, annotations, and figures for consortium publication and reuse.
🗓️Jun. 2023 – Oct. 2024
Developed scACT to translate between paired scRNA-seq and scATAC-seq with high fidelity.
Designed modality-bridging embeddings robust to sparsity, library-size shifts, and batch effects.
Benchmarked against state-of-the-art translators across datasets, cell types, and perturbations.
Enabled imputation of missing modalities to support downstream regulatory inference and integration.
🗓️Sep. 2022 – Jun. 2024
Integrated multi-cohort brain single-cell epigenomic datasets for atlas-scale regulatory analysis.
Built reproducible pipelines for QC, harmonization, and confound-aware cross-study integration.
Learned robust chromatin representations to stabilize cell-state inference under sparsity and batch shifts.
Produced cell-type annotations and regulatory programs to support consortium figures and publications.
🗓️Jun. 2023 – Feb. 2024
Curated genomic and clinical datasets from public repositories; harmonized formats and metadata.
Developed ML curriculum and delivered hands-on workshops for biomedical researchers.
Coordinated with partner institutions to deploy AI workflows for translational health studies.
🗓️Jun. 2021 – Jul. 2022
Introduced Translator, a transfer-learning framework pretraining scATAC-seq models on high-quality references.
Learned reusable chromatin embeddings that improved clustering and cell-type annotation under extreme sparsity.
Performed domain adaptation to heterogeneous protocols, reducing batch effects and sequencing-depth shifts.
Benchmarked across cohorts and tissues; delivered stable peak–gene links and regulatory program discovery
🏛️Carnegie Mellon University
📍Biomedical Machine Learning · Multimodal Modeling
🗓️Jul. 2020 – Jun. 2021
Supervisor: Prof. Ziv Bar-Joseph; Collaborator: Astarte Medical
Built ML models predicting neonatal growth trajectories from clinical and microbiome time series.
Modeled missing and irregular EHR records with Hidden Markov Models and imputation.
Suggested feeding interventions via model-based counterfactual analysis and cohort stratification
🗓️Nov. 2018 – May. 2020
Supervisor: Prof. Jian Ma
Linked histology images and transcriptomes via graph neural networks for joint tissue profiling.
Learned shared multimodal embeddings using autoencoders for alignment and cross-modality retrieval.
Enabled spatially aware interpretation via feature attribution over learned tissue representations.
🏛️University of California, San Diego
📍Biomedical Databases
🗓️Jun. 2016 – Jun. 2018
Supervisors: Prof. Xiaoqian Jiang, Prof. Shuang Wang
Mapped clinical variables into OMOP CDM schemas to improve interoperability across databases.
Built visualization tools for cohort exploration, missingness auditing, and temporal trend analysis.
Contributed to OHDSI efforts on scalable, global health data standardization and reuse.

Work Experience
#

🏛️University of California, Irvine
🗓️Sep. 2021 - Mar. 2026
Developed interpretable ML for single-cell multi-omics: translation, integration, and regulatory reasoning.
Built transfer-learning and generative models for modality prediction under sparsity and batch shift.
Collaborated in PsychENCODE, SCORCH, and PTSD/MDD consortia to deliver pipelines and atlases.
Co-authored peer-reviewed publications; released reproducible code, datasets, and documentation for reuse.
Mentored junior researchers on model design, evaluation, and scientific communication.
🗓️Sep. 2021 - Mar. 2025
Head TA for ICS 6D Discrete Mathematics and TA for CS 141 Programming Languages in multiple offerings (247+ students).
Designed problem sets, quizzes, and exams; aligned objectives, rubrics, and learning outcomes.
Built Gradescope autograding and Q&A workflows to scale feedback for large enrollments.
Led sections, review sessions, and office-hour systems across in-person and hybrid delivery.
🏛️Carnegie Mellon University
🗓️Jul. 2020 - Jun. 2021
Built predictive models for neonatal growth using longitudinal clinical and microbiome time series.
Engineered cohort-aware feature pipelines with missingness modeling and temporal alignment of visits.
Applied probabilistic sequence models to handle irregular sampling and sparse EHR data.
Evaluated generalization via site-aware splits, calibration, and uncertainty-aware model selection.
Produced reproducible training/inference workflows and analysis reports for clinical collaborators.
🗓️Jan. 2020 - May. 2020
Led recitations and office hours; supported the rapid transition to remote/hybrid instruction.
Developed programming assignments, solutions, and rubrics aligned with course learning objectives.
Built autograding and feedback workflows to scale timely, consistent grading for large enrollments.
Provided targeted support on debugging, scientific computing, and quantitative modeling fundamentals
🏛️University of California, San Diego
🗓️Mar. 2016 - Mar. 2018
Led small-group and one-to-one tutoring in programming, data structures, and data-science fundamentals.
Coordinated weekly with instructors to design sections, refine assignments, and target common misconceptions.
Built fair, rubric-driven assessments; improved clarity of exam questions and scaled feedback with Gradescope.
Provided inclusive, scaffolded support via flexible office hours and high-touch debugging guidance.

Skills
#

  • Programming: Python, R, Java, C/C++, SQL, Ocaml, Haskell, Prolog, Matlab, WebDev
  • AI/Machine Learning Tools: PyTorch, Tensorflow, Conda, Huggingface
  • Bioinformatics Tools: Seurat, ArchR, Signac, Pegasus, Squidpy, Cellranger, Samtools, Bowtie, David
  • Teaching: Tools Canvas, BlackBoard, Gradescope, Zybooks
  • Languages: Chinese (Mandarin), English, Japanese

Services
#

Journal Reviewer
#

  • BioData Mining
  • BMC Bioinformatics
  • BMC Genomics
  • Discover Computing
  • Discover Genetics and Evolution
  • npj Artificial Intelligence
  • Scientific Reports
  • The Journal of Supercomputing
  • Translational Psychiatry

Conference Reviewer
#

  • 2024 IEEE International Conference on Bioinformatics and Biomedicine
  • 2024-2025 Conference on Information and Knowledge Management
  • 2024-2026 International Conference on Intelligent Systems for Molecular Biology
  • 2026 ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Program Committee Member
#

  • 2024-2025 International Conference on Artificial Intelligence for Medicine, Health, and Care

Awards
#

  • Best Student Paper Award, 2024 Asian Conference on Machine Learning
  • Recipient, NSF Student Travel Award

There are no articles to list here yet.