Home

Topic

machine learning

31 articles across 10 sub-topics

Sub-topic

Ai

16 articles

LLM Hallucinations: Causes, Detection, and Mitigation Strategies

LLM Hallucinations: Causes, Detection, and Mitigation Strategies

TLDR: LLMs hallucinate because they are trained to predict the next plausible token — not the next true token. Understanding the three hallucination types (factual, faithfulness, open-domain) plus the five root causes lets you choose the right mitiga...

28 min read
Reinforcement Learning: Agents, Environments, and Rewards in Practice

Reinforcement Learning: Agents, Environments, and Rewards in Practice

TLDR: Reinforcement Learning trains agents to make sequences of decisions by learning from rewards and penalties. Unlike supervised learning, RL learns through trial and error rather than labeled examples. Use it for sequential decision problems wher...

14 min read

SFT for LLMs: A Practical Guide to Supervised Fine-Tuning

TLDR: Supervised fine-tuning (SFT) is the stage where a pretrained model learns task-specific response behavior from curated input-output examples. It is usually the first alignment step after pretraining and often the foundation for later RLHF. Good...

12 min read

Why Embeddings Matter: Solving Key Issues in Data Representation

TLDR: Embeddings convert words (and images, users, products) into dense numerical vectors in a geometric space where semantic similarity = geometric proximity. "King - Man + Woman ≈ Queen" is not magic — it is the arithmetic property of well-trained ...

13 min read

What are Logits in Machine Learning and Why They Matter

TLDR: Logits are the raw, unnormalized scores produced by the final layer of a neural network — before any probability transformation. Softmax converts them to probabilities. Temperature scales them before Softmax to control output randomness. 📖 T...

11 min read

Unlocking the Power of ML, DL, and LLM Through Real-World Use Cases

TLDR: ML, Deep Learning, and LLMs are not competing technologies — they are a nested hierarchy. LLMs are a type of Deep Learning. Deep Learning is a subset of ML. Choosing the right layer depends on your data type, problem complexity, and available t...

14 min read