Publications

denotes co-first authors.

AI Research

LLM Fine-tuning
The Blessing of Dimensionality in LLM Fine-tuning: A Variance-Curvature Perspective
Qiyao Liang, Jinyeop Song, Yizhou Liu, Jeff Gore, Ila R. Fiete, Risto Miikkulainen, Xin Qiu
arXiv preprint (2026)
We analyze why overparameterized LLMs fine-tune so well, revealing that high dimensionality reduces variance and smooths the loss landscape, creating a blessing rather than a curse.
KG RAG
Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning
Jinyeop Song, Song Wang, Julian Shun, Yada Zhu
Under Review (2026)
We propose an RL-trained agent that efficiently navigates knowledge graphs for retrieval-augmented generation, achieving strong transfer across diverse KG structures.
Task Vectors
Jinyeop Song, Seungwook Han, Pulkit Agrawal, Jeff Gore
ICML 2025 Spotlight
We study task-vector formation in in-context learning through an encoder-decoder perspective, linking compact internal representations to in-context predictions.
Bayesian LLM
When Do LLMs Improve Bayesian Optimization? A Systematic Comparison Across Molecular and Protein Design
Mattias Akke, Soojung Yang, Jurģis Ruža, Jinyeop Song, Elton Pan, Rafael Gómez-Bombarelli
NeurIPS 2025 AI4Science Workshop
We systematically benchmark when LLM-guided Bayesian optimization outperforms classical methods across molecular and protein design tasks.
Protein Embeddings
Probing the Embedding Space of Protein Foundation Models through Intrinsic Dimension Analysis
Soojung Yang, Juno Nam, Tynan Perez, Jinyeop Song, Xiaochen Du, Rafael Gómez-Bombarelli
NeurIPS 2025 AIDrugX Workshop
We probe protein foundation model embeddings via intrinsic dimension analysis, revealing how representation geometry relates to downstream task performance.
Empowerment
Jinyeop Song, Jeff Gore, Max Kleiman-Weiner
ICML 2026
We introduce a framework to estimate the empowerment of LLM agents—their capacity to influence the environment—as a step toward safer AI deployment.
SPOT
When AI Co-Scientists Fail: SPOT — Benchmark for Automated Verification of Scientific Research
G. Son, J. Hong, H. Fan, H. Nam, H. Ko, S. Lim, Jinyeop Song, et al.
arXiv preprint (2025)
We present SPOT, a benchmark for evaluating whether AI co-scientist systems can reliably verify claims in scientific papers across multiple domains.
MethylGPT
MethylGPT: a foundation model for the DNA methylome
Kejun Ying, Jinyeop Song, Haotian Cui, et al.
Under Review at Nature Methods (2024)
We develop a GPT-style foundation model pretrained on large-scale human DNA methylation data, enabling zero-shot biological age prediction and disease classification.
Reconciling Scaling Laws
Tim Pearce, Jinyeop Song
TMLR 2024
We resolve the apparent contradiction between Kaplan and Chinchilla neural scaling laws by identifying key methodological differences and proposing a unified framework.
Resource Model
A Resource Model for Neural Scaling Law
Jinyeop Song, Ziming Liu, Max Tegmark, Jeff Gore
ICLR 2024 BGPT Workshop
We draw an analogy between neural scaling and ecological resource competition, offering a mechanistic model that predicts power-law scaling behavior in neural networks.

Biophysics Research

Coalescence
Interspecies Interactions Drive Community-level Selection in Microbial Coalescence
Jinyeop Song, Jiliang Hu, Jeff Gore
Nature Ecology & Evolution (2026)
We show that interspecies interactions, rather than individual species fitness, are the primary driver of community-level outcomes when microbial communities merge.
Microcosm Attractors
Transition from Global Stability to Multiple Attractors in Microcosms
Jeff Gore, Jiliang Hu, You He, Matthieu Barbier, Jinyeop Song, Guy Bunin
Under Review at Nature Portfolio (2025)
We experimentally demonstrate how microbial ecosystems transition from a single global equilibrium to multiple stable states as community complexity increases.
Antibody Catenation
Noncovalent Antibody Catenation on a Target Surface Greatly Increases the Antigen-Binding Avidity
Jinyeop Song, Bo-Seong Jeong, Seong-Woo Kim, et al.
eLife 2023
We discover that antibodies can form noncovalent chain-like structures on antigen surfaces, dramatically enhancing binding avidity through cooperative multivalent interactions.
Bacterial Pathogen ID
Rapid species identification of pathogenic bacteria from a minute quantity exploiting three-dimensional quantitative phase imaging and artificial neural network
Geon Kim, Daewoong Ahn, Minhee Kang, Jinho Park, DongHun Ryu, YoungJu Jo, Jinyeop Song, et al.
Light: Science & Applications 2022
We combine 3D quantitative phase imaging with deep learning to achieve rapid, label-free identification of individual bacterial pathogens at the single-cell level.
CAR-T Immunological Synapses
Deep-Learning Based Three-Dimensional Label-Free Tracking and Analysis of Immunological Synapses of CAR-T Cells
Moosung Lee, Young-Ho Lee, Jinyeop Song, Geon Kim, YoungJu Jo, HyunSeok Min, Chan Hyuk Kim, YongKeun Park
eLife 2020
We develop a deep-learning framework for 3D label-free tracking and quantitative analysis of immunological synapses formed by CAR-T cells in real time.