From my understanding, the mod file like deepseek_r1_sigmoid_top_k_routed_scaled_f32.mod is the kernel operation really carry out calculation like inference deepseek_r1. I wound like to know how is the mod file generated? And where can I find the source code.
Seems like there is only one cpp file in this repository, and it did not contain all the code.
From my understanding, the mod file like deepseek_r1_sigmoid_top_k_routed_scaled_f32.mod is the kernel operation really carry out calculation like inference deepseek_r1. I wound like to know how is the mod file generated? And where can I find the source code.
Seems like there is only one cpp file in this repository, and it did not contain all the code.