• Pantoffel@feddit.de
    link
    fedilink
    arrow-up
    3
    ·
    11 months ago

    So what are you building? A browser STT interface for chatting with GPT and other LLMs?

    • flossdaily@lemmy.world
      link
      fedilink
      arrow-up
      7
      ·
      edit-2
      11 months ago

      I’m not ready to talk about it in detail. Even my boss doesn’t know. But you’re in the right ballpark.

      I’m actually building a proof-of-concept prototype for what I want to work on… and I’m using a browser extension so that I can build it independently without anyone from the tech team being involved and slowing me down.

      • Pantoffel@feddit.de
        link
        fedilink
        arrow-up
        2
        ·
        11 months ago

        That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.