About me

Hello there!

I am a student interested in AI and Software Development. I am currently studying Computer Science and Engineering at Yonsei University, South Korea.

I have several years of experience in Startups and Software Development. I am currently working as an undergraduate intern at MIRLAB, Yonsei University (advised by Prof. Youngjae Yu) with a focus on Multimodal Learning, Visual Language Models, and Music Generation.

Feel free to reach out to me if you have any questions or if you would like to collaborate on research or project.

Research Interests & Projects

BGMGen (2024-01 Software Capstone Project)

Music Generation / Visual Language Models / Multimodal Learning

We present BGMGen, a novel framework for generating background music that seamlessly aligns with video content. BGMGen leverages multimodal learning models (CLIP/MusicGen) to conditionally create music based on the visual information in videos, specifically focusing on the emotional tone and human actions depicted. Our framework evaluates the congruence of the generated music with the video using the VAD (Valence-Arousal-Dominance) space, a vector space that effectively represents emotional states. By integrating visual cues and emotional context, BGMGen aims to enhance the overall audiovisual experience through precisely tailored background music.

Human Body Reconstruction (during internship in Innoyard)

3D Scanning / Depth Estimation / RGBD Odometry

Conducted research on 3D reconstruction using photos taken from mobile devices and contributed to developing a human body 3D scanner app. Additionally, I created an application for detecting plagiocephaly, designed for both marketing and data collection purposes.

Education

  1. Yonsei University, School of Computing

    Mar. 2019 — Present

    B.S Computer Science and Engineering

  2. HKUST, School of Engineering

    Aug. 2018 — Feb. 2019

    B.E General Engineering, (Attended)

  3. The British School, New Delhi

    Aug. 2014 — Aug. 2018

    Class of 2018, International Baccalaureate Diploma

Experience

  1. Undergraduate Intern, MIRLAB (Multimodal Intelligence Research LAB)

    Apr. 2024 — Present

    Interested in Multimodal Learning, LLM Reasoning, and Music Generation. (advised by Prof. Youngjae Yu)

  2. Software Developer, Innoyard

    Nov. 2020 — Jan. 2023

    InnoScan Contributed to human body 3D scanning using mobile device LIDAR sensors, with a focus on applying open-source depth map estimation methods for more accurate results
    Speedoo Developed web application for infants, which facilitates manual testing to determine whether an infant has Plagiocephaly or Craniosynostosis. (LINK: https://speedoo.app/#homeView)

Extracurricular Activities

  1. Google Developer Group, Yonsei

    Sept. 2023 — Present

    ML/AI Participant (Core)

  2. Student Council, Department of Computer Science

    Jan. 2019 — Dec. 2020

    Promotion Manager

Research

My skills

  • Problem-solving
    100%
  • Writing
  • Development
    100%
  • Design
    100%

Portfolio

Blog

Paper blog

  • Yonsei

    VidMuse

    Blog about VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling