Literature and Resources
Good books and resources to read
This section provides a curated list of books and resources to enhance your understanding of Large Language Models. Each recommendation includes a brief description to help you choose the most suitable resources for you.
Large Language Models
Hosted LLMs
These are powerful LLMs hosted by companies, which you can access through APIs (Application Programming Interfaces). You typically pay for usage.
- OpenAI (ChatGPT): The creators of ChatGPT and GPT-4, offering a range of models.
- Mistral: A European-based company offering competitive models.
- Google (Gemini): Google’s LLM, offering strong performance and integration with Google services.
- Anthropic (Claude): Known for its its ability to handle code effectively.
Local LLMs
These tools can be used to run open-source LLMs that you can download and run on your own machine. This gives you more privacy and control, but requires more technical expertise and computational resources.
- Ollama: Free and open-source tool to run large language models locally, supports a wide range of models. Note, that the models are not as powerful as the hosted ones and that your computer needs to have a good GPU to run larger models. Smaller models with less than 8B parameters can also often be run on a CPU with enough available RAM. Great for privacy and if you don’t want to pay for the hosted models.
- Hugging Face: Hosts a wide range of large language models, including models fine-tuned for specific tasks by the community. Models can also be downloaded to Ollama and run locally, if your computer is powerful enough.
Working with data
In addition to the hosted and local LLMs, there are also tools that allow you to work with LLMs in a browser to build RAG apps or custom chatbots.
- NotebookLM**: Google’s Gemini that can be fed with files, images and YouTube videos to generate text based on the content. Only works within a workspace of Google, you can’t make it available to the public (yet).
- LM Studio: An application for discovering, downloading, and running LLMs locally. It supports various model architectures and offers both a Chat UI and an OpenAI-compatible local server. Features include offline document chat capabilities and easy model downloads from Hugging Face.
- Open Web UI: Open Web UI is a tool to run large language models locally (in conjuction with, for example, Ollama). It is a browser-based interface that allows you to interact with the models and build RAG apps.
Further resources
- Quarto: A static website generator that is very powerful and flexible. Used to create the slides and the website for the course.
- Github: The largest provider for git repositories owned by Microsoft. A lot of open source projects are hosted here and you can read the code.
- Daily Dose of Data Science: A website and a newsletter with lots of easy-to-digest resources to improve your skills in Data Science and Large Language Models.
- Alpha Signal: A newsletter with lots of resources on the current developments of Large Language Models.