Yejin Hwang

Yejin Hwang

Data Analyst
Data Scientist
View My Work Contact Me
scroll
About Me

Hi, I'm Yejin 👋

Yejin at Samsung Medical Center
🔬 Samsung Medical Center
Samsung Medical Center · Seoul, 2022–23

I believe data has a story — and my job is to find it, tell it, and make it matter. That belief didn't start in a lecture hall. It started in a hospital corridor in Seoul, where I met cancer patients face-to-face to collect clinical data. Looking at the numbers later, I realized each row in a spreadsheet was a person — their pain, their hope, their story. That moment changed everything for me.

2018
Sungkyunkwan University (SKKU)
B.S. Culture & Technology + Food Science & Biotechnology · started at the intersection of media, music & tech
2020
First Step into Data Science
COVID sparked a deep curiosity about human health & disease — led me to nutritional epidemiology and the power of data-driven research
2022
Nutritional Epidemiology Lab
Undergraduate researcher at CliNE Lab (SKKU) under Prof. Jinhee Hur — bridging biotechnology and data analysis
2022–23
Samsung Medical Center
Clinical data researcher · oncology studies (esophageal & breast cancer) · Johns Hopkins-affiliated PIs · real-world patient data
2024
M.S. Data Science — TAMUCC, USA
Full scholarship · GPA 4.0 · expanded into ML, DL, NLP, time-series forecasting
2026
Now 🎯
Thesis → KDD 2026 · seeking Data Analyst / Data Scientist roles in the U.S. · 3-year STEM OPT available

3-year STEM OPT available · open to opportunities across the U.S.

🏅 SKKU President Award
📄 KDD 2026 Submission
🎤 2× Conference Speaker
🔬 Samsung Medical Center
GPA 4.0 Graduate Fellow
Technical Skills

What I work with

Languages & Tools
Python SQL / MySQL PostgreSQL Git / GitHub AWS LaTeX
ML / DL Frameworks
PyTorch TensorFlow Scikit-learn 🤗 Hugging Face 📈 FinBERT ⚡ XGBoost 🔮 TFT ⏱ Chronos-T5 🌐 TimesFM 📉 ARIMA
Data & Visualization
Pandas NumPy 📊 Matplotlib 🎨 Seaborn 🗂 Tableau / Power BI 🔧 ETL Pipelines 📋 KPI Dashboards
Research & Domains
⏳ Time-Series Forecasting 🧠 LLMs / NLP 💬 Sentiment Analysis 🤖 Explainable AI 🔗 Causal Inference 📡 Data Integration ✍️ Technical Writing
Soft Skills
🤝 Cross-functional Collaboration 🎙️ Stakeholder Communication 🧑‍🏫 Research Mentoring 📌 Project Management
Featured Projects

Selected Projects

From Reddit rabbit holes to hospital data — projects that started with a question I couldn't stop thinking about. Some are research, some are practice, all are personal. 🔍

📌 Thesis · KDD 2026 Under Review
Can Reddit Predict the Stock Market?
It started with a late-night scroll through a Korean stock community — someone with a "stock guru" badge posted about quantum computing stocks. I bought in, watched them jump 200%, and thought: is this actually signal? That curiosity became a full research project combining Reddit sentiment + deep learning Transformers.
TFTFinBERTPyTorchNLPReddit APIHugging Face
Financial Forecasting
🔁 Precursor to Thesis · NLP · Time-Series
Reddit-Driven Stock Forecasting
The earlier version that started it all — before the thesis expanded the scope. Focused on TSLA & NVDA with ARIMA, TimesFM, Chronos, and TFT. The results were promising enough (RMSE ↓40.2% for TSLA, ↓87.9% for NVDA) that I kept going: more tickers, richer sentiment features, deeper models. That became the thesis.
TFTChronos-T5VADERTimesFMARIMA
Reddit Stock Forecasting
📊 Tableau · Education · Demographics · 🔄 Ongoing
TAMU Enrollment Trends
Analyzing annual enrollment trends and demographic shifts across the Texas A&M University System — built on real-world IPEDS data (cleaned and processed from scratch). Currently in progress: cleaning raw institutional data, building SQL queries, and constructing the full visualization pipeline.
📂 Data source: NCES IPEDS 2024 ↗
Tableau PublicPower QuerySQLPython · PandasData Cleaning
TAMU Enrollment Dashboard
📊 Tableau · Consumer Brand · Market Analysis
Apple in Korea — A Data Story
Deep-diving into Apple's brand performance and product line popularity in the Korean market — how a global tech brand plays out in one of the world's most brand-conscious consumer markets. Built with simulated sales data to practice real analytical storytelling.
Tableau PublicBrand AnalysisConsumer InsightsKorea Market
Apple in Korea Dashboard
Education

Academic Background

Aug 2024 – May 2026
Texas A&M University–Corpus Christi
Corpus Christi, TX
M.S. Data Science · GPA 4.0 · Full Scholarship
  • Graduate Fellow & Math Teaching Assistant (2 years).
  • Thesis: Financial Time-Series Forecasting with Deep Learning and Social Media Sentiment — submitted to KDD 2026.
  • Focus areas: ML/DL, NLP, Time-Series Forecasting, Statistical Modeling.
Mar 2017 – Aug 2022
Sungkyunkwan University (SKKU)
Seoul & Suwon, Korea
B.S. Culture & Technology + Double Major: Food Science & Biotechnology
  • 🏆 President Award for R&D Innovation — top research recognition at SKKU.
  • #16 in Asia · #87 World (QS/THE 2026) — one of Korea's top 4 universities.
  • Double major bridging digital media/technology and life sciences — the intersection that led me to data.
  • Undergraduate researcher at CliNE Lab (nutritional epidemiology) under Prof. Jinhee Hur.
Relevant Experience

Where I've worked

Sep 2024 – Present
TAMUCC · AI Lab
Corpus Christi, TX
Graduate Research Assistant
  • Thesis research at the intersection of financial time-series modeling and social sentiment analysis.
  • FinBERT + TFT / Chronos-T5 / TimesFM multimodal forecasting framework on Reddit data.
  • Fine-tuning large foundation models; building scalable real-time inference pipelines.
Aug 2022 – Aug 2023
Samsung Medical Center
Seoul, Korea
Clinical Data Research Assistant
  • Epidemiological studies under Prof. Juhee Cho (Johns Hopkins) and Prof. Danbee Kang.
  • Managed & cleaned clinical datasets for oncological and rare disease research.
  • Assisted in data coordination protocols for multi-site epidemiological projects.
Sep 2023 – Jun 2024
Banpo Fineman Academy
Seoul, Korea
Math Instructor
  • Instructed Calculus, Statistics, and Linear Algebra in Seoul's most competitive academic district.
  • Specialized in Mathematical Olympiad (KMO) and Pre-Medical track preparation.
Mar 2022 – Jul 2022
SKKU · CliNE Lab
Suwon, Korea
Research Assistant
  • Literature review and data handling for lab-scale experiments under Prof. Jinhee Hur.
  • Interdisciplinary research bridging biotechnology and data analysis.
Conferences & Activities

Beyond the Lab

Conferences, communities, and moments that shaped who I am as a researcher.

SKKU President Award
🏅 SKKU President Award
1st Place · SKKU Biotech Jamboree

Awarded the President's Prize (총장상) — 1st place at the 11th SKKU Biotech Jamboree. Interdisciplinary research at the intersection of biotechnology and data, presented to university leadership and faculty judges.

View Official Announcement →
9th Coastal Bend Math & Statistics Conference
🎤 Conference Speaker
9th Coastal Bend Mathematics & Statistics Conference

Presented thesis research at Texas A&M International University, Laredo, TX · April 2025. One of the most nerve-wracking and rewarding experiences — standing at a podium and defending your work to a room full of academics.

View on LinkedIn →
SHPE 2025 National Convention
🌐 SHPE Member
SHPE 2025 National Convention

Attended the Society of Hispanic Professional Engineers National Convention — one of the largest STEM career events in the U.S. Career fair, networking, and a reminder that diverse communities make science stronger.

View on LinkedIn →
Certifications

Certifications

🎓 Google Data Analytics Professional Certificate  ·  Coursera  ·  Issued Mar 2026

Process Data from Dirty to Clean
Google · Mar 2026
Verify ↗
Google Data Analytics Capstone
Google · Mar 2026
Verify ↗
Share Data Through the Art of Visualization
Google · Mar 2026
Verify ↗
Prepare Data for Exploration
Google · Mar 2026
Verify ↗
Ask Questions to Make Data-Driven Decisions
Google · Mar 2026
Verify ↗
Foundations: Data, Data, Everywhere
Google · Mar 2026
Verify ↗
Introduction to Data Analysis Using Python
Google · Coursera
Verify ↗
Crash Course on Python
Google · Coursera
Verify ↗
DeepLearning.AI
Retrieval Augmented Generation (RAG)
DeepLearning.AI · Mar 2026
Verify ↗
Activities

A little more about me ✨

I think the best analysts are curious people first — here's what makes me, me.

🏋️
Gym & Movement
Gym

Working out is my reset button. When the code won't cooperate, the gym always will.

🎸
Music Digger
Music

Former bass guitar player in a university rock band. Now I'm always digging — K-pop, jazz, house, R&B, hip-hop, remixes, you name it. Spotify and YouTube Music algorithms are genuinely one of the best things humans have built. Give me a genre and I'll find the deep cuts.

🌏
Language & Travel
Travel

Korean (native), English (fluent), Japanese (learning — Duolingo streak going 🦉). Every language opens a new world — different ways to think, connect, and understand people. I love traveling and meeting people from all over; half the reason I learn languages is just to talk to more humans.

🎨
Aesthetic Nerd
Aesthetic

I love making things look good — fashion, design, dashboards. There's something deeply satisfying about turning something messy into something beautiful. (Yes, this portfolio is also a passion project.)

📚
Perpetual Learner
Studying

Always picking up a new skill — whether it's a new ML framework, a Tableau technique, or whatever rabbit hole I fall into that week. Curiosity is my default setting.

💬
How People Describe Me
Friends

"Curious." "Sweet." "Works harder than anyone." I'd add: chaotic good, aesthetics-obsessed, and permanently excited about data.

Contact

Thanks for visiting!

Open to PhD opportunities, research collaborations, and data science roles. I'd love to hear from you.

or send a message