Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
View zhiyichin's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Block or report zhiyichin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zhiyichin/README.md

Hi there 👋 This is Zhi-Yi's GitHub Profile

🍀 I'm a researcher focused on AI safety, interpretability, and trustworthy machine learning.

👀 Currently, I'm a visiting research fellow at the University of Oxford working with Fazl Barez on scalable interpretability methods for LLM capability analysis and safety benchmarking. I'm also a research assistant at the @NYCU-RL-Bandits-Lab at National Yang Ming Chiao Tung University working with Ping-Chun Hsieh on RL backdoor attack detection and post-hoc interpretation of text-to-image model misbehavior, collaborating closely with Pin-Yu Chen from IBM Research. I'll be starting my PhD at CISPA Helmholtz Center for Information Security soon, where I'll work with Mario Fritz on trustworthy AI systems.

🔬 Research Interests

  • AI safety & red-teaming
  • Trustworthy text-to-image generation
  • Reinforcement learning security
  • Interpretability & mechanistic understanding

📫 Get in Touch

📄 CV / 🐦 Twitter / 🐱 GitHub / 🎓 Google Scholar / 💼 LinkedIn / 📷 Instagram / 🧵 Threads / 📘 Facebook

In my free time, I enjoy 🏃running, 📚reading, and exploring 🧁dessert and ☕️coffee shops.

I would like to connect if you have similar interests in all the things I've mentioned above (AI Safety research, running, reading, dessert, coffee). Please feel free reaching out to me at zchin31415[AT]gmail.com

You are the 👇 visitor who visits my profile 😆

Pinned Loading

  1. P4D P4D Public

    [ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementation)

    Python 51 1

  2. DNN-accelerator-on-zynq DNN-accelerator-on-zynq Public

    Digital Design Lab Spring 2019 Final Project

    Verilog 13 1

  3. yolov5-svhn-detection yolov5-svhn-detection Public

    Pytorch implementation of homework 2 for VRDL course in 2021 Fall semester at NYCU.

    Python 10

  4. 3D_Augmentation 3D_Augmentation Public

    3D point cloud data augmentation

    Jupyter Notebook 6 2

  5. personal-website-template personal-website-template Public template

    My personal website as template

    HTML 5

  6. MPO_Reimplementation MPO_Reimplementation Public

    Reimplementation of Maximum a Posteriori Policy Optimisation

    Python 3 2

Morty Proxy This is a proxified and sanitized view of the page, visit original site.