Actualités
Actualités
A Coding Implementation to Design a Stateful Tutor Agent with Long-Term Memory, Semantic Recall, and Adaptive Practice Generation
In this tutorial, we build a fully stateful personal tutor agent that moves beyond short-lived...
A Coding Implementation to Build and Train Advanced Architectures with Residual Connections, Self-Attention, and Adaptive Optimization Using JAX, Flax, and Optax
In this tutorial, we explore how to build and train an advanced neural network using...
A Coding Implementation to Build an Uncertainty-Aware LLM System with Confidence Estimation, Self-Evaluation, and Automatic Web Research
In this tutorial, we build an uncertainty-aware large language model system that not only generates...
A Coding Implementation to Build a Unified Tool Orchestration Framework from Documentation to Automated Pipelines
In this tutorial, we build a compact, efficient framework that demonstrates how to convert tool...
A Coding Implementation to Build a Transformer-Based Regression Language Model to Predict Continuous Values from Text
We will build a Regression Language Model (RLM), a model that predicts continuous numerical values...
A Coding Implementation to Build a Hierarchical Planner AI Agent Using Open-Source LLMs with Tool Execution and Structured Multi-Agent Reasoning
In this tutorial, we build a hierarchical planner agent using an open-source instruct model. We...
A Coding Implementation to Build a Complete Self-Hosted LLM Workflow with Ollama, REST API, and Gradio Chat Interface
In this tutorial, we implement a fully functional Ollama environment inside Google Colab to replicate...
A Coding Implementation on Qwen 3.6-35B-A3B Covering Multimodal Inference, Thinking Control, Tool Calling, MoE Routing, RAG, and Session Persistence
In this tutorial, we build an end-to-end implementation around Qwen 3.6-35B-A3B and explore how a...
A Coding Implementation on Microsoft’s OpenMementos with Trace Structure Analysis, Context Compression, and Fine-Tuning Data Preparation
In this tutorial, we work with Microsoft’s OpenMementos dataset and explore how reasoning traces are...
A Coding Implementation on kvcached for Elastic KV Cache Memory, Bursty LLM Serving, and Multi-Model GPU Sharing
In this tutorial, we explore kvcached, a dynamic KV-cache implementation on top of vLLM, to...
A Coding Implementation on Deepgram Python SDK for Transcription, Text-to-Speech, Async Audio Processing, and Text Intelligence
In this tutorial, we build an advanced hands-on workflow with the Deepgram Python SDK and...
A Coding Implementation on Building Self-Organizing Zettelkasten Knowledge Graphs and Sleep-Consolidation Mechanisms
In this tutorial, we dive into the cutting edge of Agentic AI by building a...