<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Zeyu Zou | UCSC OSPO</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/author/zeyu-zou/</link><atom:link href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/zeyu-zou/index.xml" rel="self" type="application/rss+xml"/><description>Zeyu Zou</description><generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><image><url>https://deploy-preview-1007--ucsc-ospo.netlify.app/author/zeyu-zou/avatar_huc593f21a5158f740d0ae75940ee94474_567636_270x270_fill_q75_lanczos_center.jpg</url><title>Zeyu Zou</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/author/zeyu-zou/</link></image><item><title>Final Report — RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/report/osre25/uci/rag-st/09302025-zeyu/</link><pubDate>Sun, 28 Sep 2025 00:00:00 +0000</pubDate><guid>https://deploy-preview-1007--ucsc-ospo.netlify.app/report/osre25/uci/rag-st/09302025-zeyu/</guid><description>&lt;p>Hello! I’m Zeyu Zou! I have been contributing to the &lt;strong>RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics&lt;/strong> project under the mentorship of Ziheng Duan. My project focuses on developing a framework that predicts spatial gene expression from histology images by combining vision encoders with single-cell RNA-seq references. The goal is to make spatial transcriptomics more affordable, interpretable, and scalable for the research community.&lt;/p>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>RAG-ST is designed to reduce the cost and complexity of spatial transcriptomics by leveraging existing histology images and scRNA-seq priors. This work integrates computer vision with retrieval-augmented generation to improve prediction accuracy and interpretability.&lt;/p>
&lt;h2 id="methods">Methods&lt;/h2>
&lt;p>The project used a two-stage pipeline:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Vision encoder&lt;/strong> (ResNet50/ViT) to map histology patches to cell type distributions.&lt;/li>
&lt;li>&lt;strong>Retrieval-augmented generation&lt;/strong> guided by scRNA-seq profiles to predict gene expression.&lt;/li>
&lt;/ol>
&lt;p>Datasets included &lt;strong>HEST-1K&lt;/strong> (paired histology and expression) and &lt;strong>CellxGene Census&lt;/strong> as the reference database. Training and evaluation pipelines were implemented in PyTorch.&lt;/p>
&lt;h2 id="results">Results&lt;/h2>
&lt;ul>
&lt;li>Implemented a complete pipeline from histology preprocessing to expression prediction.&lt;/li>
&lt;li>Achieved higher correlation scores (Pearson/Spearman) and lower errors (MSE/MAE) compared to baseline models.&lt;/li>
&lt;li>Produced spatial gene expression maps with interpretable retrieval traces and attention weights.&lt;/li>
&lt;li>Released open-source code, preprocessing scripts, and analysis notebooks for reproducibility.&lt;/li>
&lt;/ul>
&lt;h2 id="future-work">Future Work&lt;/h2>
&lt;ul>
&lt;li>Extend experiments to additional tissues (lung, liver, tumor samples).&lt;/li>
&lt;li>Test cross-dataset generalization and robustness.&lt;/li>
&lt;li>Explore integration into clinical pathology workflows for affordable spatial inference.&lt;/li>
&lt;/ul>
&lt;h2 id="acknowledgments">Acknowledgments&lt;/h2>
&lt;p>Thanks to my mentor Ziheng Duan, the UC OSPO team, the HEST-1K dataset contributors, and the CellxGene Census project. This work was conducted under OSRE 2025.&lt;/p>
&lt;h2 id="links">Links&lt;/h2>
&lt;ul>
&lt;li>Repository: &lt;a href="https://github.com/ZeyuZou/rag-st" target="_blank" rel="noopener">https://github.com/ZeyuZou/rag-st&lt;/a>&lt;/li>
&lt;li>Preprint: &lt;em>RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics&lt;/em> (bioRxiv, 2025)&lt;/li>
&lt;/ul></description></item><item><title>RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/report/osre25/uci/rag-st/06192025-zeyu/</link><pubDate>Thu, 19 Jun 2025 00:00:00 +0000</pubDate><guid>https://deploy-preview-1007--ucsc-ospo.netlify.app/report/osre25/uci/rag-st/06192025-zeyu/</guid><description>&lt;p>Hi everyone! My name is Zeyu, and I will be working on a project for a retrieval-enhanced generative framework for spatial transcriptomics during Google Summer of Code 2025. My project is called &lt;a href="https://ucsc-ospo.github.io/project/osre25/uci/rag-st/" target="_blank" rel="noopener">&lt;strong>RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics&lt;/strong>&lt;/a> and is supervised by &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/ziheng-duan/">Ziheng Duan&lt;/a>. The goal is to develop a retrieval-enhanced generative framework for predicting spatial gene expression from histological images, making spatial transcriptomics more affordable and easier to implement. &lt;a href="https://drive.google.com/file/d/1_yUf1NlVRpBXERCqnOby7pgP4WrWrZsr/view?usp=sharing" target="_blank" rel="noopener">You can view my full proposal here!&lt;/a>&lt;/p>
&lt;p>Spatial transcriptomics enables the capture of gene expression profiles with spatial resolution, providing unprecedented insights into cellular organization and the tissue microenvironment. However, its widespread application is limited by high costs and technical complexity. In contrast, histological imaging is inexpensive and widely accessible. If we can accurately predict gene expression from histology images, then high-resolution spatial information can be inferred without costly experiments.&lt;/p>
&lt;p>My project will:&lt;/p>
&lt;ul>
&lt;li>Create a large-scale paired dataset combining HEST histology images with reference gene expression profiles from CellxGene.&lt;/li>
&lt;li>Design a novel RAG-ST architecture that enables both &lt;strong>interpretable&lt;/strong> and &lt;strong>controllable&lt;/strong> generation of spatial gene expression.&lt;/li>
&lt;li>Benchmark RAG-ST against current state-of-the-art models for image-based gene expression inference.&lt;/li>
&lt;li>Open-source the full codebase and provide comprehensive tutorials to support future research and development.&lt;/li>
&lt;/ul>
&lt;p>I am excited to contribute to this project and help broaden access to spatial transcriptomics insights through machine learning–powered predictions!&lt;/p>
&lt;p>Zeyu Zou&lt;/p>
&lt;p>University of Northeastern Graduate&lt;/p>
&lt;p>Zeyu Zou is a graduate student at the University of Northeastern, where he is majoring in Analytics.&lt;/p></description></item></channel></rss>