<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Joaquin Chung | UCSC OSPO</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/</link><atom:link href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/index.xml" rel="self" type="application/rss+xml"/><description>Joaquin Chung</description><generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><image><url>https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/avatar_hu7ee6a1b8f3281a9341e8e47267961d0b_21973_270x270_fill_q75_lanczos_center.jpg</url><title>Joaquin Chung</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/</link></image><item><title>Architecting the Future of Scientific Data: Multi-Site Streaming Without Compromise</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/project/osre25/anl/scistream/</link><pubDate>Mon, 10 Feb 2025 00:00:00 +0000</pubDate><guid>https://deploy-preview-1007--ucsc-ospo.netlify.app/project/osre25/anl/scistream/</guid><description>&lt;p>Data is generated at ever-increasing rates, yet it’s often processed more slowly than it’s collected. Scientific instruments frequently operate below their full capacity or discard
valuable data due to network bottlenecks, security domain mismatches, and insufficient real-time processing capabilities.&lt;/p>
&lt;p>&lt;a href="https://github.com/scistream/scistream-proto" target="_blank" rel="noopener">SciStream&lt;/a> reimagines how scientific data moves across modern research infrastructure by providing a framework for high-speed (+100Gbps)
memory-to-memory streaming that doesn’t compromise on security. Whether connecting scientific instruments to analysis clusters or bridging across institutional boundaries, SciStream provides the foundation for next-generation scientific
workflows.&lt;/p>
&lt;p>Building on our &lt;a href="https://dl.acm.org/doi/abs/10.1145/3502181.3531475" target="_blank" rel="noopener">published research&lt;/a>, we’re now expanding the framework’s capabilities through open-source development and community
collaboration. These projects offer an opportunity for
students to gain hands-on experience with cutting-edge networking and security technologies used in high-performance computing (HPC), cloud infrastructure, and large-scale scientific
experiments.&lt;/p>
&lt;h3 id="scistream-securebench-a-framework-for-benchmarking-security-protocols-in-scientific-data-streaming">SciStream-SecureBench: A Framework for Benchmarking Security Protocols in Scientific Data Streaming&lt;/h3>
&lt;p>&lt;strong>Project Idea Description:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Topics&lt;/strong>: Security Protocols, Network Performance, Data Streaming, Reproducibility, High-throughput Computing&lt;/li>
&lt;li>&lt;strong>Skills&lt;/strong>: Python, Scripting, Linux, Network Protocol Analysis, Containers, Benchmarking tools&lt;/li>
&lt;li>&lt;strong>Difficulty&lt;/strong>: Medium&lt;/li>
&lt;li>&lt;strong>Size&lt;/strong>: Large (350) hours&lt;/li>
&lt;li>&lt;strong>Mentors:&lt;/strong> &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/">Joaquin Chung&lt;/a>, &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/flavio-castro/">Flavio Castro&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Ever wondered why large scientific experiments need to move massive amounts of data securely and quickly? While TLS and SSH are standard for secure data transfer,
there’s a surprising lack of benchmarks that evaluate their performance in high-speed scientific workflows. This project aims to fill this gap by developing a benchmarking suite that
measures how different security configurations impact real-time scientific data streaming.&lt;/p>
&lt;h3 id="specific-tasks-of-the-project-include">&lt;strong>Specific Tasks of the Project Include&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Developing benchmarking tools that measure key security performance metrics like handshake latency, throughput stability, and computational overhead.&lt;/li>
&lt;li>Running &lt;strong>real-world experiments&lt;/strong> on research testbeds (Chameleon, FABRIC) to simulate scientific data patterns.&lt;/li>
&lt;li>Automating comparative analysis between TLS and SSH, with focus on streaming-specific metrics like &lt;strong>time-to-first-byte and sustained throughput&lt;/strong>.&lt;/li>
&lt;li>Documenting best practices for security protocol selection in high-performance streaming.&lt;/li>
&lt;/ul>
&lt;h3 id="why-this-matters-for-your-career">&lt;strong>Why This Matters for Your Career&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Gain expertise in &lt;strong>network security and performance analysis&lt;/strong>, highly valued in cybersecurity, cloud computing, and HPC.&lt;/li>
&lt;li>Work on a &lt;strong>real research challenge&lt;/strong> with potential for publication.&lt;/li>
&lt;/ul>
&lt;h3 id="scistream-streambench-comparative-analysis-of-scientific-streaming-frameworks">SciStream-StreamBench: Comparative Analysis of Scientific Streaming Frameworks&lt;/h3>
&lt;p>&lt;strong>Project Idea Description:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Topics&lt;/strong>: Data Streaming Protocols, Network Performance, Benchmarking, Distributed Systems, Real-time Computing&lt;/li>
&lt;li>&lt;strong>Skills&lt;/strong>: Python, ZeroMQ, EPICS/PVAccess, Linux, Performance Analysis, Visualization&lt;/li>
&lt;li>&lt;strong>Difficulty&lt;/strong>: Medium&lt;/li>
&lt;li>&lt;strong>Size&lt;/strong>: Large (350) hours&lt;/li>
&lt;li>&lt;strong>Mentors:&lt;/strong> &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/">Joaquin Chung&lt;/a>, &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/flavio-castro/">Flavio Castro&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Scientific experiments generate enormous amounts of streaming data, but how do we choose the best framework for handling it efficiently? Despite the widespread use of ZeroMQ and
&lt;a href="https://dl.acm.org/doi/10.1145/3624062.3624610" target="_blank" rel="noopener">PVApy&lt;/a>,
there’s little systematic benchmarking comparing their performance. This project will develop &lt;strong>real-world benchmarks&lt;/strong> to evaluate how different frameworks handle scientific data in
&lt;strong>high-speed environments&lt;/strong>.&lt;/p>
&lt;h3 id="the-specific-tasks-of-the-project-include">&lt;strong>The Specific Tasks of the Project Include&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Designing benchmarking methodologies to assess key performance metrics like &lt;strong>synchronization overhead, time-to-first-data, and throughput stability&lt;/strong>.&lt;/li>
&lt;li>Developing a test harness that simulates real-world streaming conditions (network variability, concurrent streams, dynamic data rates).&lt;/li>
&lt;li>Running experiments on &lt;strong>Chameleon and FABRIC testbeds&lt;/strong>.&lt;/li>
&lt;li>Automating data collection and visualization to highlight performance trends.&lt;/li>
&lt;li>Documenting best practices and framework-specific optimizations.&lt;/li>
&lt;/ul>
&lt;h3 id="why-this-matters-for-your-career-1">&lt;strong>Why This Matters for Your Career&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Get hands-on experience with &lt;strong>real-time data processing&lt;/strong> and &lt;strong>network performance analysis&lt;/strong>.&lt;/li>
&lt;li>Learn benchmarking techniques useful for &lt;strong>distributed systems, cloud computing, and high-performance networking&lt;/strong>.&lt;/li>
&lt;/ul>
&lt;h3 id="scistream-quic-next-generation-proxy-architecture-for-scientific-data-streaming">SciStream-QUIC: Next-Generation Proxy Architecture for Scientific Data Streaming&lt;/h3>
&lt;p>&lt;strong>Project Idea Description:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Topics&lt;/strong>: QUIC Protocol, Network Proxies, Performance Analysis, Protocol Design, Hardware Acceleration&lt;/li>
&lt;li>&lt;strong>Skills&lt;/strong>: Python/C++, Network Programming, QUIC (quiche/aioquic), Linux, Performance Analysis&lt;/li>
&lt;li>&lt;strong>Difficulty&lt;/strong>: Hard&lt;/li>
&lt;li>&lt;strong>Size&lt;/strong>: Large (350) hours&lt;/li>
&lt;li>&lt;strong>Mentors:&lt;/strong> &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/">Joaquin Chung&lt;/a>, &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/flavio-castro/">Flavio Castro&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Ever wondered how YouTube loads videos faster than traditional web pages? That’s because of &lt;strong>QUIC&lt;/strong>, a next-generation protocol designed for speed and security. Initial evaluations
of federated streaming architectures (&lt;a href="https://par.nsf.gov/servlets/purl/10380551" target="_blank" rel="noopener">INDIS'22
paper&lt;/a>) suggest potential benefits of QUIC, but comprehensive benchmarking is
needed. This project explores whether &lt;strong>QUIC-based proxies&lt;/strong> can outperform traditional &lt;strong>TCP+TLS&lt;/strong> proxies for scientific data streaming, potentially revolutionizing how researchers move
large datasets.&lt;/p>
&lt;h3 id="the-specific-tasks-of-the-project-include-1">&lt;strong>The Specific Tasks of the Project Include&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Developing a &lt;strong>QUIC-based proxy&lt;/strong> optimized for scientific workflows.&lt;/li>
&lt;li>Running benchmarks to compare &lt;strong>QUIC vs. traditional TLS proxies&lt;/strong>.&lt;/li>
&lt;li>Investigating &lt;strong>hardware encryption offloading&lt;/strong> for QUIC and TLS.&lt;/li>
&lt;li>Designing &lt;strong>reproducible experiments&lt;/strong> using Chameleon and FABRIC testbeds.&lt;/li>
&lt;li>Documenting best practices for deploying &lt;strong>QUIC proxies in HPC environments&lt;/strong>.&lt;/li>
&lt;/ul>
&lt;h3 id="why-this-matters-for-your-career-2">&lt;strong>Why This Matters for Your Career&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Gain experience in &lt;strong>cutting-edge networking protocols&lt;/strong> used in cloud computing (Google, Cloudflare, etc.).&lt;/li>
&lt;li>Learn about &lt;strong>hardware acceleration&lt;/strong> and its role in high-speed networking.&lt;/li>
&lt;/ul>
&lt;h3 id="scistream-auth-modern-authentication-and-user-interface-for-scientific-data-streaming">SciStream-Auth: Modern Authentication and User Interface for Scientific Data Streaming&lt;/h3>
&lt;p>&lt;strong>Project Idea Description:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Topics&lt;/strong>: Authentication Systems, UI/UX Design, Security Integration, Scientific Computing&lt;/li>
&lt;li>&lt;strong>Skills&lt;/strong>: Python, Web Development (React/Vue), OAuth 2.0/SAML, Security Analysis&lt;/li>
&lt;li>&lt;strong>Difficulty&lt;/strong>: Medium&lt;/li>
&lt;li>&lt;strong>Size&lt;/strong>: Large (350) hours&lt;/li>
&lt;li>&lt;strong>Mentors:&lt;/strong> &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/">Joaquin Chung&lt;/a>, &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/flavio-castro/">Flavio Castro&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Not a security expert? You can still contribute by designing an interactive front-end!&lt;/p>
&lt;p>In today&amp;rsquo;s scientific computing landscape, authentication and user experience often act as barriers to adoption rather than enabling seamless collaboration. While SciStream excels at
high-speed data transfer, its reliance on a single authentication provider and command-line interface limits its accessibility. This project aims to transform SciStream into a more
versatile platform by implementing a modular authentication system and developing an intuitive graphical interface.&lt;/p>
&lt;p>By expanding beyond Globus Auth to support multiple authentication frameworks, we can enable broader adoption across different scientific communities while maintaining robust security.
Coupled with a modern GUI that visualizes real-time streaming activity, this enhancement will make SciStream more accessible to researchers—allowing them to focus on their science rather
than wrestling with complex configurations.&lt;/p>
&lt;p>This project will design a user-friendly interface that makes secure scientific data streaming as intuitive as using a cloud storage service. You&amp;rsquo;ll also gain hands-on experience with
authentication methods used by industry leaders like Google and Facebook, while directly improving access to scientific data.&lt;/p>
&lt;h3 id="the-specific-tasks-of-the-project-include-2">&lt;strong>The Specific Tasks of the Project Include&lt;/strong>&lt;/h3>
&lt;ul>
&lt;li>Design and implementation of a pluggable authentication system supporting multiple providers (OAuth 2.0, SAML, OpenID Connect, certificate-based auth)&lt;/li>
&lt;li>Development of a modern, responsive GUI using web technologies that provides real-time visualization of system status&lt;/li>
&lt;li>Creation of comprehensive security testing protocols to validate the authentication implementations&lt;/li>
&lt;li>Implementation of session management and secure credential handling within the GUI&lt;/li>
&lt;li>Design of an intuitive interface for managing streaming configurations and monitoring data flows&lt;/li>
&lt;li>Creation of documentation and examples to help facilities integrate their preferred authentication mechanisms&lt;/li>
&lt;/ul></description></item><item><title>SciStream-Rep: An Artifact for Reproducible Benchmarks of Scientific Streaming Applications</title><link>https://deploy-preview-1007--ucsc-ospo.netlify.app/project/osre24/anl/scistream/</link><pubDate>Mon, 26 Feb 2024 00:00:00 +0000</pubDate><guid>https://deploy-preview-1007--ucsc-ospo.netlify.app/project/osre24/anl/scistream/</guid><description>&lt;p>&lt;a href="https://github.com/scistream/scistream-proto" target="_blank" rel="noopener">SciStream&lt;/a> is a framework and toolkit that attempts to tackle the problem of enabling high-speed(+100Gbps), memory-to-memory data streaming in scientific environments. This task is particularly challenging because data producers (e.g., data acquisition applications on scientific instruments, simulations on supercomputers) and consumers (e.g., data analysis applications) may be in different security domains and thus require bridging of those domains. Furthermore, either producers, consumers, or both may lack external network connectivity and thus require traffic forwarding proxies. If you want to learn more, please take a look at our &lt;a href="https://dl.acm.org/doi/abs/10.1145/3502181.3531475" target="_blank" rel="noopener">HPDC'22 paper&lt;/a>.&lt;/p>
&lt;h3 id="scistream-rep-an-artifact-for-reproducible-benchmarks-of-scientific-streaming-applications">SciStream-Rep: An Artifact for Reproducible Benchmarks of Scientific Streaming Applications&lt;/h3>
&lt;p>&lt;strong>Project Idea Description:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Topics&lt;/strong>: Network Performance Testing, Benchmarking, Data Streaming, Reproducibility&lt;/li>
&lt;li>&lt;strong>Skills&lt;/strong>: Python, Scripting, Linux, Containers, Networking, benchmark tools&lt;/li>
&lt;li>&lt;strong>Difficulty&lt;/strong>: Medium&lt;/li>
&lt;li>&lt;strong>Size&lt;/strong>: Large (350) hours&lt;/li>
&lt;li>&lt;strong>Mentors:&lt;/strong> &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/">Joaquin Chung&lt;/a>, &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/flavio-castro/">Flavio Castro&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>This project focuses on expanding the scope of testing SciStream’s architecture by incorporating a variety of traffic patterns based on real scientific applications. The goal is to understand how different traffic patterns influence the performance of memory-to-memory data streaming in scientific scenarios by creating artifacts for reproducible experiments. Additionally, the project will explore the use of different forwarding elements, such as Nginx and HAProxy, to assess their impact on data streaming efficiency and security.&lt;/p>
&lt;p>Reproducibility is especially difficult in shared network environments such as Chameleon and FABRIC testbeds. We can expect similar results for two exact same experiments, only when the network condition (external to our traffic) is similar for both experiments. By creating reproducible artifacts for Chameleon and FABRIC, we can build statistical confidence in the measured results by multiple repetitions from other researchers.&lt;/p>
&lt;p>The Specific Tasks of the Project Include:&lt;/p>
&lt;ul>
&lt;li>Developing a set of benchmarks to measure the performance of scientific streaming applications across a broader range of traffic patterns.&lt;/li>
&lt;li>Creating a set of artifacts for generating traffic patterns typical of data streaming applications.&lt;/li>
&lt;li>Deploying various forwarding elements within the SciStream architecture for the Chameleon and FABRIC testbeds.&lt;/li>
&lt;li>Compiling a best practices document detailing the optimal configurations for Scistream.&lt;/li>
&lt;/ul>
&lt;h3 id="scistream-lb-a-dynamic-load-balancing-solution-using-programmable-network-devices">Scistream-LB: A Dynamic Load Balancing Solution Using Programmable network devices&lt;/h3>
&lt;p>&lt;strong>Project Idea Description:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Topics&lt;/strong>: Network Performance Testing, Data Streaming, Reproducibility, Programmable Data Planes&lt;/li>
&lt;li>&lt;strong>Skills&lt;/strong>: Python/Scripting, Linux, Docker/Containers, Networking fundamentals, Experience with OpenFlow/P4 programming&lt;/li>
&lt;li>&lt;strong>Difficulty&lt;/strong>: Difficult&lt;/li>
&lt;li>&lt;strong>Size&lt;/strong>: Large (350) hours&lt;/li>
&lt;li>&lt;strong>Mentors:&lt;/strong> &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/joaquin-chung/">Joaquin Chung&lt;/a>, &lt;a href="https://deploy-preview-1007--ucsc-ospo.netlify.app/author/flavio-castro/">Flavio Castro&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>The aim of this project is to create a specialized forwarding element using OpenFlow (OF) or P4 programming languages, tailored to enhance the SciStream data plane. This new development seeks to enable a more flexible and hardware-based (and therefore more efficient) alternative to conventional software-based forwarding mechanisms like NGINX or HAProxy, specifically designed to support the needs of high-performance data streaming environments for scientific applications. The OF/P4 forwarding elements will be packaged as artifacts for reproducibility experiments in Chameleon and FABRIC testbeds. Reproducibility is especially difficult in shared network environments such as Chameleon and FABRIC testbeds. We can expect similar results for two exact same experiments, only when the network condition (external to our traffic) is similar for both experiments. By creating reproducible artifacts for Chameleon and FABRIC, we can build statistical confidence in the measured results by multiple repetitions from other researchers.&lt;/p>
&lt;p>Specific tasks of the project include:&lt;/p>
&lt;ul>
&lt;li>Design and implementation of an OF/P4-based forwarding element that can be seamlessly integrated with the data plane of SciStream’s architecture.&lt;/li>
&lt;li>Forwarding logic that supports efficient and secure memory-to-memory data streaming.&lt;/li>
&lt;li>A set of benchmarks for evaluating the new forwarding element against traditional options, focusing on improvements in throughput, latency, and security.&lt;/li>
&lt;li>An investigation on the potential advantages of programmable network elements for detailed control over data streaming paths and security configurations.&lt;/li>
&lt;li>A package of the newly developed forwarding elements as artifacts for reproducibility experiments in Chameleon and FABRIC testbeds.&lt;/li>
&lt;/ul></description></item></channel></rss>