jin-folio

a simple whitespace theme for academics

a distill-style blog post

an example of a distill-style blog post and main elements

8 min read · 2021

a post with code

an example of a blog post with some code

4 min read · 2015

NeMo-RL vs Slime — A Systems-Level Comparison

Deep comparison of two RL training frameworks — feature matrices, algorithmic differences, MoE readiness, and when to use which

9 min read · November 10, 2025

2025 · mlsys rl distributed-training · engineering
Adding Sequence Parallelism to Slime's FSDP Backend

Design doc for integrating Ring-Attention based sequence parallelism into slime's FSDP training backend — architecture, tradeoffs, and RL coupling

13 min read · September 18, 2025

2025 · distributed-training mlsys rl · engineering
A Practical Guide to LLM GPU Memory Estimation

How to estimate GPU memory for LLM training — precision formats, optimizer states, parallelism strategies, and a worked example with Qwen-2.5-7B on H100

12 min read · June 2, 2025

2025 · gpu mlsys · engineering
SGLang as GRPO Inference Backend in TRL

Integrating SGLang into HuggingFace TRL for GRPO training — server-based rollout, distributed init fixes, and memory optimization

10 min read · March 14, 2025

2025 · mlsys serving rl · engineering
a post with bibliography

an example of a blog post with bibliography

1 min read · July 12, 2023

2023 · formatting bib · sample-posts