Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
View baudhya's full-sized avatar

Block or report baudhya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
baudhya/README.md

Hi 👋, I'm Siddharth Baudh

Scientist at DRDO India | Ex-IISc | Systems & ML Infra Enthusiast

baudhya


🎓 M.Tech in Computer Science, Indian Institute of Science (IISc), Bangalore
🔬 Scientist, Defence Research and Development Organisation (DRDO), India
💼 Former Software Engineer @ Finoit Technologies
📫 Reach me: siddharthbaudhya6216@gmail.com | LinkedIn


🛠️ Interests & Specialties

  • Systems Programming: eBPF, Kernel Internals, OS-Level Instrumentation
  • Compilers & Security: LLVM FunctionPass, syscall tracing, sandboxing
  • Machine Learning Infra: CUDA, AVX512, parallel training, PyTorch, Ring-AllReduce
  • High-Performance Systems: Branch predictors, simulation (ChampSim), memory checkpointing
  • Languages & Tools: C++, Python3, eBPF, CUDA, Docker, Git, FastAPI, Celery

🚀 Highlight Projects

  • 🧩 Memory Checkpointing with eBPF – Checkpoints heap memory at runtime using syscall-level instrumentation
  • 🔐 In-Kernel Sandboxing – Combines LLVM IR instrumentation + eBPF for enforcing syscall-level policies
  • ⚙️ Optimizing Dilated Convolution – CUDA + SIMD (AVX2/AVX-512) version achieving up to 800x speedup
  • 📈 Data-Parallel LeNet – Distributed training using Ring-AllReduce; 50% faster with same accuracy

🧠 Background

I'm passionate about building robust, high-performance systems at the intersection of systems software, machine learning infrastructure, and computer security. I enjoy working close to the OS and hardware, optimizing performance, and developing infrastructure for future AI systems.


📌 Let’s Connect

Open to collaborations, research projects, and deep tech discussions in OS, ML systems, compilers, and secure computing.

📬 Email: siddharthbaudhya6216@gmail.com
🔗 LinkedIn: linkedin.com/in/baudhya

Pinned Loading

  1. dilated-convolution-optimization dilated-convolution-optimization Public

    Optimizing Performance of Dilated Convolution (DC) - Single-threaded , multi-threaded and GPU

    C++

  2. In-Kernel-Per-Process-Sandbox In-Kernel-Per-Process-Sandbox Public

    Linux kernel-level enforcement engine to enforce policy.

    Python

  3. memory-checkpoint-ebpf memory-checkpoint-ebpf Public

    A cutting-edge eBPF-based solution for efficient memory checkpointing and restoration, enabling lightweight process monitoring, low-latency execution, and seamless state recovery—offering a modern …

    Python

  4. PageRank PageRank Public

    Python

  5. spinlocks spinlocks Public

    Spinlocks are synchronization primitives used in concurrent programming to achieve mutual exclusion by having threads repeatedly check (or "spin") for lock availability, which is efficient when th…

    C++

  6. rat-race-game rat-race-game Public

    Python

Morty Proxy This is a proxified and sanitized view of the page, visit original site.