Skip to content
View katsukii's full-sized avatar

Highlights

  • Pro

Block or report katsukii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
katsukii/README.md

Hi, I'm Yusuke

Software engineer with 12 years of experience in web development (Rails / React / AWS), currently expanding into machine learning and deep learning.

Featured Project

An empirical study of neural scaling laws applied to SVG code generation. Trained GPT-style models (1.3M-88M params) on 107M tokens of SVG data and fit power-law curves to characterize how loss scales with model size. Compared Standard Parameterization with muP for zero-shot learning rate transfer.

SP vs muP scaling curves Generated SVG samples from the 88M-param model

Tech: Python, PyTorch, muP, BPE tokenization, power-law fitting

Project Page | Full Report (PDF) | Repository

Pinned Loading

  1. svg-scaling-project svg-scaling-project Public

    Neural Scaling Laws for SVG Generation

    Jupyter Notebook