Publications

LLM Fine-tuning

The Blessing of Dimensionality in LLM Fine-tuning: A Variance-Curvature Perspective

Qiyao Liang, Jinyeop Song, Yizhou Liu, Jeff Gore, Ila R. Fiete, Risto Miikkulainen, Xin Qiu

arXiv preprint (2026)

We analyze why overparameterized LLMs fine-tune so well, revealing that high dimensionality reduces variance and smooths the loss landscape, creating a blessing rather than a curse.

arXiv

Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning

Jinyeop Song, Song Wang, Julian Shun, Yada Zhu

Under Review (2026)

We propose an RL-trained agent that efficiently navigates knowledge graphs for retrieval-augmented generation, achieving strong transfer across diverse KG structures.

PDF

Task Vectors

Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective

Jinyeop Song^†, Seungwook Han^†, Pulkit Agrawal, Jeff Gore

ICML 2025 Spotlight

We study task-vector formation in in-context learning through an encoder-decoder perspective, linking compact internal representations to in-context predictions.

arXiv

When Do LLMs Improve Bayesian Optimization? A Systematic Comparison Across Molecular and Protein Design

Mattias Akke, Soojung Yang, Jurģis Ruža, Jinyeop Song, Elton Pan, Rafael Gómez-Bombarelli

NeurIPS 2025 AI4Science Workshop

We systematically benchmark when LLM-guided Bayesian optimization outperforms classical methods across molecular and protein design tasks.

Protein Embeddings

Probing the Embedding Space of Protein Foundation Models through Intrinsic Dimension Analysis

Soojung Yang, Juno Nam, Tynan Perez, Jinyeop Song, Xiaochen Du, Rafael Gómez-Bombarelli

NeurIPS 2025 AIDrugX Workshop

We probe protein foundation model embeddings via intrinsic dimension analysis, revealing how representation geometry relates to downstream task performance.

Estimating the Empowerment of Language Model Agents

Jinyeop Song, Jeff Gore, Max Kleiman-Weiner

ICML 2026

We introduce a framework to estimate the empowerment of LLM agents—their capacity to influence the environment—as a step toward safer AI deployment.

arXiv

When AI Co-Scientists Fail: SPOT — Benchmark for Automated Verification of Scientific Research

G. Son, J. Hong, H. Fan, H. Nam, H. Ko, S. Lim, Jinyeop Song, et al.

arXiv preprint (2025)

We present SPOT, a benchmark for evaluating whether AI co-scientist systems can reliably verify claims in scientific papers across multiple domains.

arXiv

MethylGPT: a foundation model for the DNA methylome

Kejun Ying^†, Jinyeop Song^†, Haotian Cui^†, et al.

Under Review at Nature Methods (2024)

We develop a GPT-style foundation model pretrained on large-scale human DNA methylation data, enabling zero-shot biological age prediction and disease classification.

bioRxiv

Reconciling Kaplan and Chinchilla Scaling Laws

Tim Pearce, Jinyeop Song

TMLR 2024

We resolve the apparent contradiction between Kaplan and Chinchilla neural scaling laws by identifying key methodological differences and proposing a unified framework.

arXiv

A Resource Model for Neural Scaling Law

Jinyeop Song^†, Ziming Liu^†, Max Tegmark, Jeff Gore

ICLR 2024 BGPT Workshop

We draw an analogy between neural scaling and ecological resource competition, offering a mechanistic model that predicts power-law scaling behavior in neural networks.

arXiv

Interspecies Interactions Drive Community-level Selection in Microbial Coalescence

Jinyeop Song, Jiliang Hu, Jeff Gore

Nature Ecology & Evolution (2026)

We show that interspecies interactions, rather than individual species fitness, are the primary driver of community-level outcomes when microbial communities merge.

bioRxiv

Transition from Global Stability to Multiple Attractors in Microcosms

Jeff Gore, Jiliang Hu, You He, Matthieu Barbier, Jinyeop Song, Guy Bunin

Under Review at Nature Portfolio (2025)

We experimentally demonstrate how microbial ecosystems transition from a single global equilibrium to multiple stable states as community complexity increases.

Research Square

Noncovalent Antibody Catenation on a Target Surface Greatly Increases the Antigen-Binding Avidity

Jinyeop Song^†, Bo-Seong Jeong^†, Seong-Woo Kim, et al.

eLife 2023

We discover that antibodies can form noncovalent chain-like structures on antigen surfaces, dramatically enhancing binding avidity through cooperative multivalent interactions.

DOI

Rapid species identification of pathogenic bacteria from a minute quantity exploiting three-dimensional quantitative phase imaging and artificial neural network

Geon Kim, Daewoong Ahn, Minhee Kang, Jinho Park, DongHun Ryu, YoungJu Jo, Jinyeop Song, et al.

Light: Science & Applications 2022

We combine 3D quantitative phase imaging with deep learning to achieve rapid, label-free identification of individual bacterial pathogens at the single-cell level.

DOI

Deep-Learning Based Three-Dimensional Label-Free Tracking and Analysis of Immunological Synapses of CAR-T Cells

Moosung Lee^†, Young-Ho Lee^†, Jinyeop Song^†, Geon Kim, YoungJu Jo, HyunSeok Min, Chan Hyuk Kim, YongKeun Park

eLife 2020

We develop a deep-learning framework for 3D label-free tracking and quantitative analysis of immunological synapses formed by CAR-T cells in real time.

DOI

AI Research

Biophysics Research