Skip to content

feat: MLflow experiment tracking for Crusoe Managed Inference#56

Open
Sakshi3027 wants to merge 1 commit into
crusoecloud:mainfrom
Sakshi3027:feat/mlflow-on-crusoe
Open

feat: MLflow experiment tracking for Crusoe Managed Inference#56
Sakshi3027 wants to merge 1 commit into
crusoecloud:mainfrom
Sakshi3027:feat/mlflow-on-crusoe

Conversation

@Sakshi3027

Copy link
Copy Markdown

What this adds

MLflow integration for tracking LLM experiments on Crusoe Managed Inference.

Logs 3 experiment runs out of the box:

  • summarization
  • reasoning
  • code generation

What gets tracked per run

  • Parameters — model, temperature, max_tokens, prompt
  • Metrics — latency_seconds, output_word_count, words_per_second
  • Artifacts — full model response saved as a text file

Why it's useful

Teams running LLM evaluations on Crusoe need a way to compare runs across prompts, temperatures, and models. This gives them a local MLflow tracking server with zero infrastructure setup just python train_and_log.py then mlflow ui.

Testing

Tested locally using Groq as a drop-in replacement. All 3 runs logged successfully with latency and throughput metrics visible in the MLflow UI.

To run on Crusoe:
export CRUSOE_API_KEY="your-api-key"
python train_and_log.py
mlflow ui

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant