Starting a Mistral Megathread to aggregate resources.

This is my new favorite 7B model. It is really good for what it is. I am excited to see what we can tune together. I will be using this thread as a living document, expect a lot of changes and notes, revisions and updates.

Let me know if there’s something in particular you want to see here. I will be adding to this thread throughout my fine-tuning journey with Mistral.

Mistral Model Megathread


Key

  • Link #1 - Base Model
  • Link #2 - Instruct Model

Quantized Base Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Samantha Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Kimiko Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Dolphin Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Orca Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Airoboros Models from TheBloke

GPTQ

GGUF

AWQ


If you like to run any of the quantized/optimized models from TheBloke, do visit the full model pages from each of the quantized model cards to see and support the developers of each fine-tuned model.

  • Anony Moose
    link
    fedilink
    English
    arrow-up
    5
    ·
    9 months ago

    That’s fair, I think chat/roleplay are great use cases.

    I also think some of these lightweight models might make for interesting personal recommendation/categorization engines, etc. In my experiments with using models to categorize credit card transaction statements ala Mint, only GPT4 was able to do a good job out of the box. I bet a small model could do quite well with fine tuning though.

    Another thought I had was to make some sort of personal recommendation engine, so you could export your Netflix/Spotify likes and have it recommend movies or music that you might enjoy, etc. I suppose it’s still early days for those kind of uses for open source models!