About Me

Hi! I’m Shou-Yi, Hung, and you can also call me Ray. I was born in Taichung, Taiwan, but grew up in Xiamen, Fujian, China. I can speak English, Chinese, a little bit of Taiwanese Hokkien, and a little bit of Japanese!

Education

I’m studying at the University of Toronto, majoring in Computer Science with a minor in Statistics.

Some of the important courses that I have taken includes:

Operating Systems, Computer Networking Systems, Deep Learning, Probabilistic Learning, Web Programming, Software Design, Software Engineering, Computer Vision, Data Structures & Algorithms, Algorithms Design & Analysis, Systems Programming, Computer Organization, Relational Databases, Linear Programming

Interests

My research interest currently lies in large language models, machine learning, and data processing. I am interested in the different applications that machine learning can be applied on to make our lives easier from day to day. Furthermore, I am interested to also discover any more optimal way that we can be using large language models, ranging from more efficient training to model interpretability.

My other interest lies in software engineering, which is reflected by my different projects that I have accomplished througout the years. I am extremely excited when I have the opportunity to build something that really makes an impact for myself and also the community around me. More details about this can be found in my portfolio page

Internship Experiences

I’m currently in my co-op internship year, working at Huawei Canada as a Machine Learning Researcher Intern at the Noah’s Ark Lab. Currently, I am focusing on Large Language Model researches, including model quantization and synthetic data generation for model training. Most of the times, I am working on research related works, that is, running experiments, reading papers, presenting ideas, etc. However, sometimes, I also get assigned deliverable works, which involves training models that can be deployed either on the cloud or edge devices. Recently, I am also working on Speculative Decoding with Sequoia for an upcoming project.

Academic Research Experiences

I have had the pleasure to be supervised by Prof. En-Shiun Annie Lee as an NLP research student, where I was given numerous tasks, ranging from conducting statistical analysis to assisting in writing up research papers.

Together, we have published a paper related to computer science education at ACM SIGCSE, and has scheduled to publish another paper concerning the low-resource-language of Taiwanese Hokkien at NAACL 2025, more details can be found under the publications page.

Aside from my NLP research experience, I’ve also had experience working as an operating systems research volunteer under the supervision of Prof. Kuei (Jack) Sun, where I signed up as a volunteer to develop features for a toy operating system (like Stanford’s PintOS) written in Rust, named KidneyOS. In the end, I completed parts of the Pool Allocator for memory management, and a little bit of ELF Header Verifications.

Teaching Experiences

I have also worked previously at the University of Toronto as a Teaching Assistant, for an introductory mathematics course (PUMP Self-Guided) for incoming first year students.

The course was focused on simple algebra and pre-calculus materials. For this course, my job focused on preparing office hours and answering inquiries from students.

Skills

Technologies

  • Python (NumPy, OpenCV, PyTorch, HuggingFace, Accelerate, Distributed Learning)
  • Java (JUnit)
  • SQL (MySQL, PostgreSQL, SQLite)
  • NoSQL (Firebase)
  • HTML/CSS
  • JavaScript
  • C/C++
  • R
  • MIPS Assembly
  • Rust

Machine Learning

  • HuggingFace
  • PyTorch
  • OpenCV
  • NumPy
  • Accelerate (Fully Sharded Data Parallel, Distributed Data Parallel)
  • Distributed Learning

Frameworks / Tools

  • Shell (Bash, Zsh, Sh)
  • Linux / Unix
  • Selenium
  • Matplotlib
  • Seaborn
  • Pandas
  • REST API
  • LaTeX
  • Flask
  • BeautifulSoup
  • React
  • Node.js
  • Docker
  • Jupyter Notebook
  • Conda
  • Git
  • CI/CD
  • Amazon Web Services (EC2)
  • Microsoft Azure