What’s the state of the art in open source GPT models right now, in practical terms? If your typical use case is taking a pretrained model and fine tuning it to a specific task, which LLM would yield the best results while running on consumer hardware? Note that I’m specifically asking for software that I can run on my own hardware, I’m not interested in paying OpenAI $0.02 per API request. I’ll start the recommendations with Karpathy’s nanoGPT: https://github.com/karpathy/nanoGPT What else do we have?
Story Published at: February 14, 2023 at 08:58PM
Story Published at: February 14, 2023 at 08:58PM