About me

I am a Machine Learning Engineer at Google developing NLP products and LLMs such as Gemini and PaLM models. I received a Ph.D. in Computer Science from Georgetown University. I was fortunate to work with my advisor, Prof. Lisa Singh. My research focuses on NLP, Language Models, Data Science, and Social Media Mining.

During my PhD time, most of my projects were collaborations with CNN, Washington Post, UMich and MDI on leveraging AI/ML methods for data-centric social concerns such as sentiment analysis, stance detection, force-migration, gun policy, MeToo movement, spam, misinformation and fake news. During summers, I spent some time working at Twitter and Google.

News
Interests
  • Large Language Models (LLMs)
  • NLP, AI, ML, Data Science
  • Efficient ML
  • Misinformation / Fake News
Education
  • PhD in Computer Science, 2017 - 2022

    Georgetown University, USA

  • BEng in Computer Engineering, 2012 - 2016

    Chulalongkorn University, Thailand

Skills

Python
Machine Learning
Data Science

Experience

 
 
 
 
 
Google Research
Software Engineer (AI/ML)
Dec 2022 – Present Mountain View, California
  • Research in multimodal LLM safety and alignment.
  • Develop LLM and NLP models such as Google Gemini, PaLM API, Sentiment Analysis, Entity Extraction, Part-of-Speech Tagging, and AutoML.
  • Research, experiment and implement new features for Google AI products used by millions of people around the world.
 
 
 
 
 
Twitter
Machine Learning Intern
Jun 2022 – Aug 2022 San Francisco, California (Remote)
  • Leverage AI/ML to improve Twitter.
  • Conduct and deploy end-to-end machine learning pipelines from research to production.
 
 
 
 
 
Google
Software Engineer Intern (AI/ML)
Jun 2021 – Aug 2021 Sunnyvale, California (Remote)
 
 
 
 
 
Massive Data Institute
Researcher
May 2020 – Dec 2022 Washington, DC
  • Research in data science, NLP and social media mining focusing on misinformation and fake news in social media.
  • Collaborate with researchers from CNN and University of Michigan to conduct and weekly report our analysis about the US election at The Breakthrough.
 
 
 
 
 
Department of Computer Science, Georgetown University
Teaching Assistant
Dec 2018 – Dec 2022 Washington, DC
  • COSC-282 Big Data Analytics (Undergraduate Level - Spring 2018)
  • COSC-287 Introduction to Data Science (Undergraduate Level - Fall 2019)
  • COSC-587 Introduction to Data Analytics (Graduate Level - Fall 2021)
 
 
 
 
 
Department of Computer Science, Georgetown University
Research Assistant
Aug 2018 – Dec 2022 Washington, DC
  • Advise student research groups working on social media mining projects.
  • Conduct research funded by Massive Data Institute (MDI) and National Science Foundation (NSF).
 
 
 
 
 
LINE Corporation (Tourkrub.co & DGM59)
Software Engineer
Feb 2017 – Jul 2017 Bangkok, Thailand
  • Gather requirements, design, development, testing and validation using Ruby on Rails.
  • Develop APIs to reduce back-office operation time by 75% including PDF bill generation, email confirmation, bank account notification for Slack, etc.
 
 
 
 
 
The Institute of Scientific and Industrial Research (ISIR), Osaka University
Research Intern
Jun 2015 – Aug 2015 Osaka, Japan
  • Research in Neuroscience and Machine Learning mainly using MATLAB and C++. The advisor is Prof. Masayuki Numao.
  • Collaboratively conduct experiments with the Biochemical Lab (Nagai Lab).
  • Develop APIs to collect streaming data from EEG brainwave headset in C++ and apply ML models to evaluate user’s emotions.
 
 
 
 
 
Department of Computer Engineering, Chulalongkorn University
Research Assistant
Jan 2015 – Dec 2016 Bangkok, Thailand
  • A member of Machine Intelligence and Knowledge Discovery Lab (MIND Lab).
  • Conducting research to solve data science problems in real-world including wind power prediction from power plants (time series), emotion prediction from brain wave (neuroscience and ML/AI) and analysis of course materials (text mining).

Publications

(2024). Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. In arXiv.

PDF Cite Technical Report Google Announcement

(2023). Forecasting Ukrainian Refugee Flows With Organic Data Sources. In IMR.

Cite Journal Article

(2023). Identifying High Quality Training Data for Misinformation Detection. In DATA.

Cite Conference Paper

(2023). Investigating Scientific Misinformation Using Different Modes of Learning. In SDU@AAAI.

PDF Cite Workshop Paper

(2022). Detecting and Understanding of Information Pollution on Social Media. In ProQuest Dissertations and Theses.

Cite PhD Dissertation

Related Courses

Graduate-level

  • Algorithms
  • Machine Learning
  • Neural Networks and Deep Learning
  • Text Mining & Analysis
  • Intro to Data Analytics
  • Massive Data Fundamentals
  • Web Search and Sense Making (Scala)
  • Dialogue Systems
  • Algorithms for Distributed Machine Learning (Ph.D. Seminar)
  • Distributed Algorithms and Systems (Ph.D. Seminar)
  • Data Protection by Design (Ph.D. Seminar)
  • Deep Reinforcement Learning (Ph.D. Sit-in)

Undergraduate-level

  • Algorithm Design
  • Artificial Intelligence
  • Software Engineering
  • Auto Speech Recognition
  • Time Series Mining
  • Web Development
  • Statistics for Physical Science
  • Data Structure (C++)
  • Programming Method (JAVA)
  • Distributed System Essential
  • System Analysis and Design
  • Computer Security
  • Database Design
  • Computer Network
  • Computer Graphic

Contact

  • kornraphop.k [at] gee-mail