ML-Mond

Transform your documentation into high-quality training datasets for fine-tuning LLMs. Upload docs, configure settings, and download structured JSONL files in minutes.

Everything you need to create training datasets

Stop manually creating training data. DataForge uses AI to generate structured datasets from your documentation, saving you hours of tedious work.
  • Simple file upload
    Drag and drop your documentation files (.md, .txt, .pdf). No signup required. Start generating datasets immediately.
  • AI-powered generation
    Leverages Claude API to intelligently extract and generate Q&A pairs, code examples, and explanations from your docs.
  • Flexible configuration
    Choose dataset type, number of examples, difficulty level, and custom templates. Full control over your training data output.
  • Preview and edit
    Review AI-generated examples before download. Edit questions, answers, and metadata directly in the browser for quality control.
  • Export in multiple formats
    Download as JSONL, CSV, or Parquet. Ready for immediate use with OpenAI, Anthropic, or any fine-tuning pipeline.
  • Real-time progress tracking
    Watch your dataset being generated with live progress updates and status messages. No waiting in the dark.

Ready to create your first dataset?

Join developers who are fine-tuning LLMs faster. Upload your documentation and get structured training data in minutes, not hours.