• 0 Posts
  • 31 Comments
Joined 1 year ago
cake
Cake day: June 21st, 2023

help-circle









  • Similar use cases to what I’m doing right now, running LLMs like Mixtral8x7B (or something better by the time we start seeing these), Whisper (STT), or Stable Diffusion.

    I use a fine tuned version of Mixtral (dolphin-Mixtral) for coding purposes.

    Transcribing live audio for notes/search, or translating audio from different languages using Whisper (especially useful for verifying claims of translations for Russian/Ukrainian/Hebrew/Arabic especially with all of the fake information being thrown around).

    Combine the 2 models above with a text to speech system (TTS), a vision model like LLaVA and some animatronics and then I’ll have my own personal GLaDOS: https://github.com/dnhkng/GlaDOS

    And then there’s Stable Diffusion for generating images for DnD recaps, concept art, or even just avatar images.