Hi HN, We’ve just published a lot of original, visual, and intuitive explanations of concepts to introduce people to large language models. It’s available for free with no sign-up needed and it includes text articles, some video explanations, and code examples/notebooks as well. And we’re available to answer your questions in a dedicated Discord channel. You can find it here: https://llm.university/ Having written https://jalammar.github.io/illustrated-transformer/, I’ve been thinking about these topics and how best to communicate them for half a decade. But this project is extra special to me because I got to collaborate on it with two of who I think of as some of the best ML educators out there. Luis Serrano of https://www.youtube.com/@SerranoAcademy and Meor Amer, author of “A Visual Introduction to Deep Learning” https://kdimensions.gumroad.com/l/visualdl We’re planning to roll out more content to it (let us know what concepts interest you). But as of now, it has the following structure (With some links for highlighted articles for you to audit): — Module 1: What are Large Language Models – Text Embeddings (https://docs.cohere.com/docs/text-embeddings) – Similarity between words and sentences (https://docs.cohere.com/docs/similarity-between-words-and-sentences) – The attention mechanism – Transformer models (https://docs.cohere.com/docs/transformer-models HN Discussion: https://news.ycombinator.com/item?id=35576918) – Semantic search — Module 2: Text representation – Classification models (https://docs.cohere.com/docs/classification-models) – Classification Evaluation metrics (https://docs.cohere.com/docs/evaluation-metrics) – Classification / Embedding API endpoints – Semantic search – Text clustering – Topic modeling (goes over clustering Ask HN posts https://docs.cohere.com/docs/clustering-hacker-news-posts) – Multilingual semantic search – Multilingual sentiment analysis — Module 3: Text generation – Prompt engineering (https://docs.cohere.com/docs/model-prompting) – Use case ideation – Chaining prompts — A lot of the content originates from common questions we get from users of the LLMs we serve at Cohere. So the focus is more on application of LLMs than theory or training LLMs. Hope you enjoy it, open to all feedback and suggestions!
Story Published at: May 25, 2023 at 01:50PM
Story Published at: May 25, 2023 at 01:50PM