Experience
Kakao Mobility (software engineer) Seoul, South Korea
Software Engineer Dec 2021 – Sep 2022
- Led a 3-person team in developing an app for 3D maps as a part of the Autonomous Driving pointcloud data pipeline.
- Designed an API protocol of automation features in collaboration with the AI team to integrate ML into the service.
- Structured and managed the Google GKE namespace to ensure compatibility with local Kubernetes, facilitating seamless development.
Stryx Seoul, South Korea
Software Engineer Mar 2020 – Dec 2021
- Reduced build time by 70%, simplified dependency management by merging multiple repositories into a mono-repository.
- Achieved OS independence by refactoring a C++ 3D algorithm module to Rust, using Node.js NAPI binding.
Stryx Seoul, South Korea
Software Engineer Intern Nov 2019 – Feb 2020
- Enhanced internal server efficiency by cutting bandwidth and TTFB by 80%, introducing multi-layer Redis caching.
- Elevated team productivity through downsizing the Docker image from 2GB to 180MB, utilizing multi-stage builds in CI.
Academic Paper
RTSUM, Yonsei University Data & Language Intelligence Lab Mar 2023 – Aug 2023
Research Intern
Cho, S., Jang, M., Yeo, J., & Lee, D. (2023). RTSUM: Relation Triple-based Interpretable Summarization with Multi-level Salience Visualization. In Proceedings of the NAACL 2024 System Demonstrations Track. Association for Computational Linguistics. https://arxiv.org/abs/2310.13895
- Published as the first author, designed Knowledge Graph (KG)-based experiment for validating Interpretable AI framework.
- Increased OpenIE5’s NLP triple extraction speed by 300% by using a reverse proxy and Docker container replicas.
Contest
GenAI Competition at Yonsei University in 2023 Seoul, South Korea
- Won the Grand Prize in collaboration as a part of a team of two for the project MBTI-GPT. Dec 2023
Projects
TPCxAI, Yonsei University Mar 2024 – May 2024
Team Leader
- Implemented a Big Data system for the AI pipeline from scratch on AWS using Hadoop and Spark cluster, employing HDFS.
- Migrated the manual EC2 cluster to Amazon EMR, reducing management costs by simplifying system dependencies.
ReSRer, Yonsei University Data & Language Intelligence Lab Sep 2023 – Jan 2024
Research Intern
- Improved ODQA(Open-Domain Question Answering) performance by 20% with zero-shot LLM context manipulation.
- Enabled large-scale QA benchmarks by indexing 21M Wikipedia passages into a Milvus vector database in 12 hours.
- Boosted LLM evaluation by 40% by introducing a multi-GPU local inference server, Huggingface TGI with asynchronous batch processing.
MBTI GPT, AI Service with 1000+ users Oct 2023 – Feb 2024
Team Leader
- Implemented enterprise-level application of AI personality analyzer using Redis, OpenAI API, Node.js and Faiss.
- Ensured the security of RAG by the approach of real-time vector indexing with dynamic data splitting in-memory.
- Reduced OpenAI API costs by 30% by prompt optimization, utilizing code from ‘LLM as optimizers’ paper.
Yokhal Mar 2024 – Apr 2024
- Accelerated LLM training speed using Pytorch’s multi-node distributed training DDP and FSDP with 4 x RTX3090.
- Implemented a user-centric Chatbot by fine-tuning Gemma on Korean chat and wiki datasets on PEFT QLoRa.
LLaMa2GPTQ Jun 2023 – July 2023
- Designed and developed a local assistant AI application utilizing LangChain, ChromaDB and Streamlit.
- Optimized computing and memory cost by 75% using 4-bit GPTQ quantization applied to the LLaMa2 model.
Texonom Nov 2021 – Jun 2023
- Integrated a Recommender system into the service, devised to build a context extended model through ONNX.
- Developed vector search API for RAG by embedding whole 30,000 pages in service into Postgres pgVector database.
To smooth Jun 221 – July 2021
- Extended Chaikin’s Smooth Algorithm implementation to a multi-dimensional library and deployed it to the NPM registry.
HDGen, Kakao Mobility Oct 2020 – Sep 2022
Leader of Project Development
- Improved vector mapping speed by developing a method to extract normal vectors from pointcloud data.
- Rendered millions of 2D/3D vector data and pointcloud points within the app manipulating Geoserver and PostGiS.
MMS-TWR, Stryx Jun 2020 – Dec 2020
Member of Team
- Upgraded internal service reliability by migrating from bare metal to OpenStack by building Docker image for applications.
MMS-VHCL, Stryx Jun 2020 – Oct 2020
Member of Team
- Obtained LiDAR sensor control for vehicles with real-time pointcloud communication from hardware.
STPano, Stryx Mar 2020 – Jun 2020
Member of Team
- Raised the precision of drone coordinate positioning by developing a mathematic algorithm for panoramic images.
Education
YONSEI UNIVERSITY Seoul, South Korea
Candidate for BSc in Computer Science focused on AI Mar 2017 – Expected June 2024
- Expected GPA: 3.68/4.5 (92.1%), Jinri Scholarship (1 semester of 2018)
- Key Modules: Reinforcement Learning, NLP, Big Data, ML, Computer Vision, Computer Security, Signal Processing, Algorithm for AI, Automata, Computer Architecture, OS, Linear Algebra, Statistics, Discrete Math, OOP, Data Structure
UNIVERSITY OF CALIFORNIA, RIVERSIDE Riverside, CA, United States
Exchange Student Sep 2022 – Jan 2023
- Relevant courses: Capstone Design, Computer Graphics, Compiler Design
Activities and Leadership
Yonsei University Seoul, South Korea
Student Representative Sep 2017 – Mar 2018
- Organized events such as retreats and coordinated several cooperative purchases for the department’s shared uniforms.
University of California, Riverside Riverside, CA, United States
Exchange Student Sep 2022 – Jan 2023
- Extracurricular activities: Member of the Tennis Club, visited CES 2023, and observed a Falcon 9 launch event in Florida.
Jaramter Kindergarten Masan, South Korea
Volunteer Jun 2010 – Nov 2016
- Conducted activities with disabled children during each vacation throughout middle and high school.
Skills and Interest
- Academic Interests: Explainable AI, AI Question Answering, Transformers, LLM, Mechanistical Interpretability
- Technical Writings
- Cho, S. (2024, April 10). Superposition Hypothesis for Steering LLM with Sparse AutoEncoder. Medium. https://seongland.medium.com/superposition-hypothesis-for-steering-llm-with-sparse-autoencoder-c07b74d23e96
- Cho, S. (2024, April 22). Reversing Transformer to Understand In-Context Learning with Phase Change & Feature Dimensionality. Medium. https://seongland.medium.com/reversing-transformer-to-understand-in-context-learning-with-phase-change-feature-dimensionality-13cbf8a2f984
- Programming Languages: Python, Rust, Typescript, C++, JavaScript, Bash, Java
- Skills: Pytorch, PEFT, GCP, FSDP, Model Training, Prompt Engineering, RAG, Vector Database, Hadoop, Faiss, AWS PostgreSQL, Redis, Cloud Computing, Kubernetes, Docker, ONNX, Git, CI/CD, Distributed Systems, ETL, Spark
- Languages: English (Fluent), Korean (Native), Japanese (Intermediate)