Blog

  • DoLa – Decoding by Contrasting Layers Improves Factuality in Large Language Models

    DoLa – Decoding by Contrasting Layers Improves Factuality in Large Language Models

    Have you ever talked to an LLM and they answered you something that sounds like they’ve been drinking machine coffee all night long πŸ˜‚ That’s what we call a “hallucination” in the LLM world! But don’t worry, because it’s not that your language model is crazy (although it can sometimes seem that way πŸ€ͺ). The…


  • QLoRA: Efficient Finetuning of Quantized LLMs

    QLoRA: Efficient Finetuning of Quantized LLMs

    Hello everyone! πŸ€— Today we are going to talk about QLoRA, the technique that will allow you to make your language models more efficient and faster ⏱️. But how does it do it? πŸ€” Well, first it uses quantization to reduce the size of the model weights, which saves memory and speed πŸ“ˆ. Then, it…


  • GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

    GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

    Attention developers! 🚨 Do you have a language model that is too big and heavy for your application? 🀯 Don’t worry, GPTQ is here to help you! πŸ€– This quantization algorithm is like a wizard that makes unnecessary bits and bytes disappear, reducing the size of your model without losing too much precision. 🎩 It’s…


  • llm.int8() – 8-bit Matrix Multiplication for Transformers at Scale

    llm.int8() – 8-bit Matrix Multiplication for Transformers at Scale

    Get ready to save space and speed up your models! πŸ’₯ In this post, I’m going to explore the llm.int8() method, a quantization technique that allows you to reduce the size of your machine learning models without sacrificing too much accuracy. πŸ“Š That means you’ll be able to train and deploy larger and more complex…


  • LLMs quantization

    LLMs quantization

    Imagine having a giant language model that can answer any question, from the capital of France to the perfect brownie recipe! πŸžοΈπŸ‡«πŸ‡· But what happens when that model has to fit on a mobile device? πŸ“± That’s where quantization comes in! πŸŽ‰ This technique allows us to reduce the size of models without sacrificing their…


  • LoRA – low rank adaptation of large language models

    LoRA – low rank adaptation of large language models

    Get ready to take your model adaptation to the next level with LoRA! πŸš€ This low-rank adaptation technique is like a superhero cape for your neural networks – it helps them learn new tricks without forgetting old ones 🀯. And the best thing about it? You can implement it in just a few lines of…


  • Fine tuning Florence-2

    Fine tuning Florence-2

    You’ve already got Florence-2 on your radar, but now you want to take it to the next level! πŸš€ It’s time for fine-tuning! πŸ’» In this post, I’ll walk you step-by-step through the process of adapting this multimodal model to your specific needs. πŸ“Š From preparing your data to setting up hyperparameters, I’ll show you…


  • Fine tuning SMLs

    Fine tuning SMLs

    πŸ˜‚ “Tuning small language models? Please, you’re not ‘tuning’, you’re just desperately trying to get some sense out of a model that’s about as useful as a chocolate teapot 🍫🚽. But hey, who doesn’t love a good challenge? Go ahead, waste your GPU hours, and maybe, just maybe, you’ll get a model that can distinguish…


  • GPT-2 – Language Models are Unsupervised Multitask Learners

    GPT-2 – Language Models are Unsupervised Multitask Learners

    Unlock the power of text generation with GPT-2, the latest open model from OpenAI πŸ’Έ! πŸš€ In this post, I take you by the hand through the architecture behind this model, and show you how to fine-tune it 😜, code included. Read more and find out how GPT-2 can make your words more interesting than…


  • Florence-2

    Florence-2

    Attention everyone! 🚨 We have a new king in town! πŸ‘‘ Florence-2, the multimodal model that is revolutionizing the artificial intelligence game. 🀯 With only 200M parameters in its base version (or 700M in its large version, for those who want with everything πŸ’₯), this model is sweeping the benchmarks, beating models with 10 and…


  • BPE tokenizer

    BPE tokenizer

    πŸ” Discover the secret of tokenization! πŸ”‘ I reveal you the mysteries of BPE (Byte Pair Encoding) tokenization, one of the most popular and effective methods to split text into tokens. Learn how to to tokenize with BPE! πŸ’» Read my post and discover the tips and tricks to master tokenization with BPE! πŸ“„


  • GPT1 – Improving Language Understanding by Generative Pre-Training

    GPT1 – Improving Language Understanding by Generative Pre-Training

    Unlock the power of language!!!! πŸ’₯ In my last post, I take you by the hand through the GPT-1 paper, explaining in a clear and concise way how this pioneer model in natural language processing works. And not only that! I also show you how to fine-tuning the model so you can adapt it to…


  • Hugging Face Optimun

    Hugging Face Optimun

    Attention, slow PyTorch models! 🐌 Optimun, the Hugging Face library, comes to the rescue to speed up your workouts and inferences. With Optimun, you can forget about speed issues and enjoy more speed and efficiency πŸ•’οΈ. And best of all, it’s PyTorch compatible – go on, give your models a boost with Optimun! πŸ’»


  • Ollama

    Ollama

    πŸƒ Run any LLM on your computer, whether it is a NASA spacecraft πŸš€ or a coffee maker πŸ«–. With Ollama you have the facility to use any language model as if you were using docker 🐳. Moreover, as the models are quantized πŸ“₯, they will fit in memory 🧠 and they will go very…


  • Hugging Face Accelerate

    Hugging Face Accelerate

    πŸš€ Parallelizing AI models has never been easier! 🀩 With Accelerate, you can say goodbye to endless training times ⏰ and hello to efficiency πŸ’». With just a few lines of code, you can automatically distribute the training of your models across multiple GPUs or CPUs, which can drastically reduce training times ⏱️, improve efficiency…


  • Hugging Face Evaluate

    Hugging Face Evaluate

    Forget sleepless nights calculating metrics and losing your mind with evaluate for your NLP models! 😩 Hugging Face’s evaluate library is the answer to your prayers, allowing you to evaluate the performance of your models easily and quickly πŸš€. With evaluate, you can say goodbye to manual calculations and hello to full automation πŸ€— leaving…


  • Hugging Face Datasets

    Hugging Face Datasets

    Wake up, AI developer! β˜•οΈ Stop wasting your time loading and preprocessing data like a slave πŸ™…β€β™‚οΈ and discover Hugging Face’s datasets library, the solution to all your NLP problems (and some of your life problems, too). With datasets, you can access a collection of high-quality datasets that will make you forget you ever had…


  • Hugging Face Transformers

    Hugging Face Transformers

    πŸ€– Transform your world with Hugging Face Transformers! πŸš€ Ready to make magic with natural language? From super-fast techniques with pipeline πŸŒͺ️ to ninja tricks with AutoModel πŸ₯·, this post takes you by the hand on an epic adventure into the NLP universe. Explore how to generate text that surprises, train models that dazzle, and…


  • git-sim

    git-sim

    If you are afraid 😱 to use git commands, come in and learn how to test them easily and graphically with git-sim 😊


  • Hugging Face Tokenizers

    Hugging Face Tokenizers

    πŸ“„ ➑️ πŸ”€ Explore the power of the Hugging Face Tokenizers library for natural language processing in AI. Discover how this essential tool transforms text into structured data, optimizing AI model training with practical examples and Python code. Dive into the future of NLP with our expert guide.


  • Transformers – from top to the bottom

    Transformers – from top to the bottom

    Discover transformers πŸš€. Learn the architecture behind all the new language models. Don’t ask an AI, come in and learn.


  • Vector database

    Vector database

    Discover how ChromaDB redefines vector data search and management for AI applications, with ultra-fast and accurate similarity searches in high-dimensional spaces.


  • Regular expressions

    Regular expressions

    It explores the power and precision of regular expressions, essential tools in programming that allow describing patterns in text strings for efficient search, extraction and manipulation. This post unravels from the fundamentals to advanced regex techniques, offering a practical guide with clear examples to master their use in data analysis and management. Get ready to…


  • CSS

    CSS

    CSS, or Cascading Style Sheets, is a fundamental technology in web design that allows developers and designers to style and present HTML documents in a sophisticated and efficient manner. Through CSS, we can control layout, colors, fonts, and much more, enabling the creation of rich and visually appealing user experiences. This post explores the basics…


  • Mixtral-8x7B

    Mixtral-8x7B

    In this post, we explore Mixtral 8x7B, a state-of-the-art language model developed by Mistral AI. This model, a sparse mixture of experts (SMoE) with open weights, stands out for its ability to outperform models such as Llama 2 70B and GPT-3.5 on most benchmarks, despite having only 46.7B parameters. Mixtral uses a unique architecture where…


  • Measure of similarity between embeddings

    Measure of similarity between embeddings

    It explores in depth the key methods of measuring similarity in embedding systems: cosine similarity, L2 distance, and scalar product. This article explains each technique, offering an expert perspective on when and how to use them. Learn how to choose the right metrics for your embedding models and improve your recommender systems, natural language processing…


  • Embeddings

    Embeddings

    In this post we explore the world of embeddings in artificial intelligence, an essential technique that transforms complex data, such as text, into numerical formats understandable to machine learning models. Discover how embeddings capture meanings and relationships in data, facilitating language processing, and understand their impact on the advancement of LLM technology.


  • Tokens

    Tokens

    You keep hearing that one model has a context window of x tokens? And that another has y tokens? But you have no idea what tokens are? Come in and make yourself comfortable and I’ll tell you what tokens are.


  • HTML

    HTML

    Índice Comments Head Body Container labels Header Navigation Links Main Footer Section Article Divider Unordered ul lists Sorted lists ol Sorted lists ol Content tags Multimedia tags Image optimization img tag figure tag video tag audio tag iframe tag dialog label Forms Grouping of elements Label Select Send Details HTML Comments This notebook has been…


  • OpenAI API

    OpenAI API

    πŸš€ “Discover the power of the OpenAI API in this post! 🌟 Learn how to install the OpenAI library ✨ and I’ll guide you through the first steps to become an artificial intelligence guru. πŸ€– No matter if you’re a curious beginner or a coding expert looking for new adventures, this post has everything you…


  • Docker

    Docker

    🐳 Create as many development environments as you want with Docker. No more “On my machine it works”.


  • Fail2ban

    Fail2ban

    πŸ›‘οΈ πŸ’» Raise your security game! Dive into this tutorial and find out how Fail2Ban helps you stop brute force attacks in their tracks. From installation to configuration, I’ve got you covered. #SecurityToTheMaximum


  • Caffeine

    Caffeine

    πŸš€ Wake up, Ubuntu! β˜• With Caffeine for Linux, say goodbye to those annoying sleep modes just when you least expect it. Work marathon? Eternal download? Caffeine is here to keep your machine up and running. This cool thing integrates seamlessly with your Ubuntu environment and is a breeze to use. Find out more about…


  • Conda

    Conda

    Conda vs Pip – the eternal battle! Find out why Conda wears the champion belt πŸ₯Š and how Mamba is not just a dance, but the turbo your data science projects were waiting for πŸš€. Dive with us into this maze of Python packages and find your way out in style 😎 🐍. #PackagingAdo


  • Git

    Git

    Have you rewritten that code that took you hours to do? Don’t panic! With GIT, say goodbye to “Oops, I deleted it”. Join me on this adventure and learn how to play with versions like a programming rockstar. 🎸 πŸ’Ύ #GITandRoll


  • Whisper

    Whisper

    πŸ‘‚ You don’t get to gossip what the first one says? With whisper you can not only do it, but also translate it if you want. Log in and learn how


  • Bash scripting

    Bash scripting

    Feel like an old school programmer 🐧 with this post about bash scripting.


  • SSH

    SSH

    Connecting securely is essential in today’s digital world. Discover how SSH allows you to establish encrypted communications between your machine and remote servers. From basic concepts to advanced tips, dive into this comprehensive tutorial and master the craft of SSH. πŸ” πŸ’Ό #SSHSecure.


  • Blip2

    Blip2

    🀯 Can you imagine that you can add images to a gpt chat and ask questions about them? With Blip2 you can, log in and learn how.


  • Terminal

    Terminal

    🟒 Feel like a real hacker πŸ–₯ knowing everything about the terminal πŸ’». Get inside and start handling ⌨ the terminal like a pro.


  • Data management with Pandas

    Data management with Pandas

    πŸ“Š Manage data πŸ“ easily with Pandas πŸ“–.


  • Matrix calculation with Numpy

    Matrix calculation with Numpy

    🟒 Learn how to do all the matrix calculation you want in an agile way with Numpy


  • Introduction to Python

    Introduction to Python

    Python 🐍 is one of the MOST USED πŸ’» programming languages. Go inside and learn everything you need to know about Python 🐍.