Experience

Research Scientist, Analytics & AI Methods at Scale (AAIMS) Group
Oak Ridge National Laboratory (ORNL) October 2020 – Present
- Founding member of AAIMS group; technical lead for operational data analytics and AI systems. Lead teams of 3-5 core researchers with coordination across 15-20 cross-functional collaborators.
- SC21 Best Paper for Summit power efficiency analysis across 27,000+ GPUs.
- Built Frontier power monitoring infrastructure processing 100+ GiB/day via Kafka/Spark pipelines, enabling real-time operational insights.
- Core contributor to ExaDigiT digital twin framework (R&D 100 Award), coordinating 25+ institutions for exascale facility modeling.
- Designed LLM-based systems for predictive analytics and operational data queries, achieving 26% accuracy improvement over baseline.
Research Associate / Postdoctoral Research Associate
Oak Ridge National Laboratory (ORNL) February 2017 – September 2020
- Developed HPC storage middleware optimizations for burst buffer systems, published at IPDPS'19.
- Led NVMe vendor evaluation for Summit/Frontier deployment, testing 5 vendors across 4,600+ node configurations.
- Designed Cooling Intelligence system for Summit, projecting 20% energy savings through predictive thermal management.
- Built foundational telemetry infrastructure and analytics pipelines for ORNL Leadership Computing Facility operations.
Research Assistant
Seoul National University, South Korea September 2010 – February 2017
- Developed cross-layer SSD optimizations, integrating custom FTLs, OS enhancements, and FPGA-based emulation and prototyping
- Designed high-performance SSD storage architectures for HPC, reducing tail latency in key-value store
- Developed a custom key-value storage engine with Samsung SSD garbage collection APIs, improving latency consistency demonstrating 6-9x reduction in 99.9999 percentile read latency
Research Engineer | Software Engineer
TmaxSoft June 2006 – September 2010
- Designed and developed a non-intrusive middleware transaction instrumentation framework (LD_PRELOAD-based), enabling end-to-end performance monitoring of enterprise applications. Built function-hooking transaction latency monitoring modules for products such as BEA Tuxedo, TmaxSoft Tmax, and Oracle using function interception, lock-free shared memory based IPC.
- Led the application instrumentation layer deployment effort of the LG Display Zero Failure Project (LG Display Ltd.), delivering a function intercept-based middleware application transaction monitoring system to their mission critical Manufacturing Execution System (MES).
Software Developer
Samsung Networks, South Korea (merged into Samsung SDS) March 2003 – June 2006
- Maintained and enhanced NMSPlus 3.0–3.1, a network monitoring system collecting SNMP, ping & Netflow statistics from Cisco, Alcatel, and Juniper devices.
- Developed SNMP-based data collection modules for ATM switches and L4 switches, expanding network monitoring capabilities.

Education

Ph.D., Electrical Engineering and Computer Science (MA & Ph.D. integrated)
Seoul National University September 2010 – February 2017
Dissertation: “OS I/O Stack Optimizations for Flash Solid-State Drives”, Supervised by Heonyoung Yeom.
Read Thesis
B.Sc., Computer Science
Korea University March 1996 – February 2003

Experience

Research Scientist, Analytics & AI Methods at Scale (AAIMS) Group

Research Associate / Postdoctoral Research Associate

Research Assistant

Research Engineer | Software Engineer

Software Developer

Education

Ph.D., Electrical Engineering and Computer Science (MA & Ph.D. integrated)

B.Sc., Computer Science