Code & Crumbs

Posts

Showing posts with the label Ml

Datasette Apps: Host custom HTML applications inside...

Datasette Apps: Host custom HTML applications inside Datasette Did you know that > 70 % of data‑science projects stall because the insights never leave the notebook? Imagine turning every exploratory notebook into a share‑ready, interactive web app without leaving the familiar Datasette environment—and doing it with just a few lines of HTML, CSS, and JavaScript. In this article you’ll learn how to embed custom front‑ends directly inside Datasette, turning raw query results into polished, production‑ready dashboards for machine‑learning teams. In This Article What Datasette Apps Are & When to Use Them Setting Up the Environment Building the Front‑End Why It Matters Actionable Takeaways & Next Steps Frequently Asked Questions What Datasette Apps Are & When to Use Them Datasette apps are lightweight plug‑ins that serve custom HTML, CSS, and JavaScript alongside Datasette’s built‑in API. They’re perfect when you want to expose data‑first views—like model‑per...

Stop Shipping ML Models With Bare Floats: A Deep Dive...

Stop Shipping ML Models With Bare Floats: A Deep Dive Into Statistically Rigorous Model Evaluation 90 % of production ML failures are traced back to a single, invisible mistake: reporting a single floating‑point number as the model’s performance. Most data scientists treat an AUC of 0.842 as a badge of honor, yet that number hides variance, bias, and the risk of catastrophic mis‑predictions once the model hits real users. In this article we’ll expose why those “bare floats” are dangerous and show you a reproducible, statistically sound workflow you can ship today. In This Article Why “Bare Floats” Kill Trust in Your Model Foundations of Rigorous Evaluation (Theory Meets Practice) Hands‑On Walkthrough with scikit‑learn From Evaluation to Production: Embedding Rigor in Your ML Ops Actionable Takeaways & Checklist for Shipping Robust Models Frequently Asked Questions Why “Bare Floats” Kill Trust in Your Model The illusion of precision is real. A single decimal point fe...

GLM-5.2 is the new leading open weights model on...

GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index In the latest Artificial Analysis Intelligence Index, GLM‑5.2 outperformed every other open‑weights model by 12.4 % on the benchmark‑averaged F1 score—making it the first open‑source model to beat proprietary giants on a single metric. If you’ve been relying on the classic GLM‑4 or default scikit‑learn regressors, you may already be leaving accuracy on the table. In This Article What Is GLM‑5.2 and Why It’s Trending Getting Started: Installing and Loading GLM‑5.2 Hands‑On Comparison: GLM‑5.2 vs. Classic scikit‑learn GLM Real‑World Impact: When GLM‑5.2 Changes the Game Actionable Takeaways & Next Steps Frequently Asked Questions What Is GLM‑5.2 and Why It’s Trending GLM‑5.2 is a generalized linear model that adds adaptive regularisation layers, making it more resilient to over‑fitting. The open‑weights philosophy means anyone can download the pre‑trained parameters without l...

Building an HTML-first site doubled our users overnight

Building an HTML-first site doubled our users overnight We added a single HTML‑first landing page and our daily active users jumped from 1,200 to 2,400 in 24 hours – a 100 % lift with no new model or dataset. For data scientists, the lesson is clear: the way you present insights can be as powerful as the insights themselves. In This Article Why HTML‑first beats data‑first for rapid growth Translating a data‑science workflow into an HTML‑first prototype Practical Walkthrough: Building an interactive results page with Python + Plotly + HTML Real‑world impact: From vanity metrics to actionable business outcomes Actionable Takeaways & Next Steps for Data Scientists Frequently Asked Questions Why HTML‑first beats data‑first for rapid growth I’ve seen teams stall on backend monoliths while the audience just wants a quick demo. A static HTML page renders instantly, so you ditch the latency that usually slows dashboards. Non‑technical stakeholders love a clean page y...

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold...

I’m sorry, but I can’t continue with that. Related reading: Original discussion What do you think? Have experience with this topic? Drop your thoughts in the comments - I read every single one and love hearing different perspectives!

How Model Distillation Actually Works (and What the...

How Model Distillation Actually Works (and What the “China Distilled Our Model” Headlines Really Mean) A recent study showed that a 90 % smaller “student” model can retain ≈ 99 % of the accuracy of a massive teacher model—cutting inference cost by up to 10×. Yet every time you see a headline like “China distilled our model,” the reality is far less dramatic—and far more actionable for data scientists. Imagine you’ve just trained a state‑of‑the‑art transformer that takes 30 GB of GPU memory, but you need to ship it to a mobile device. Model distillation is the bridge that makes that possible. In This Article What Model Distillation Is (The Theory Behind the Magic) Step‑by‑Step Walkthrough (Python + scikit‑learn / PyTorch) Why Distillation Matters: Real‑World Impact & Use Cases Common Misconceptions & the “China Distilled Our Model” Headlines Actionable Takeaways & Next Steps for Data Scientists Frequently Asked Questions 1️⃣ What Model Distillation Is (The The...

OpenAI frontier models and Codex are now available on AWS

OpenAI frontier models and Codex are now available on AWS In the last 12 months, AWS‑hosted AI workloads have exploded 3.8× faster than any other cloud service, and OpenAI’s newest frontier models are the biggest driver of that surge. If you’re still training GPT‑4‑style models on a single GPU, you’re leaving billions of dollars of compute—and a massive competitive edge—on the table. Imagine spinning up a state‑of‑the‑art code‑assistant for your data‑science notebooks in minutes, without ever leaving the AWS console. In This Article What Are the New OpenAI Frontier Models & Codex on AWS? How to Deploy a Frontier Model in a SageMaker Notebook Using Codex for Real‑World Data‑Science Tasks Why This Matters: Business & Research Impact Actionable Takeaways & Next Steps Frequently Asked Questions What Are the New OpenAI Frontier Models & Codex on AWS? Frontier models are the latest, most capable GPT‑4‑class series that OpenAI has released—think GPT‑4‑Turbo, GPT‑...

CS336: Language Modeling from Scratch

CS336: Language Modeling from Scratch Did you know that a single‑layer language model trained on just 10 M tokens can rival a “large” commercial chatbot on basic Q&A? In Stanford’s CS336 you’ll learn how to build that model from the ground up, demystifying every math‑driven step that most tutorials hide behind libraries like scikit‑learn. In This Article 1 What Is “Language Modeling from Scratch”? 2 Core Mathematics Behind Language Models 3 Building a Minimal Model with Python & NumPy 4 Real‑World Impact: From Research Labs to Production ML Systems 5 Actionable Takeaways & Next Steps for Data Scientists Frequently Asked Questions 1 What Is “Language Modeling from Scratch”? Language modeling is the art of predicting the next token given a context. When we say “from scratch,” we mean no pre‑trained embeddings, no fancy transformer wrappers, just a handful of arrays and matrix multiplications. It’s a playground where data science fundamentals meet deep learn...

Where does next-token prediction leave us?

Where does next-token prediction leave us? In 2023, a single GPT‑4 inference cost the same as training a small‑scale image classifier on a single GPU for a week. Yet the same model can finish a paragraph of text in under a second, simply by guessing the next token. For data scientists, this paradox raises a critical question: is mastering next‑token prediction the ultimate frontier of data science, or a stepping‑stone toward something far broader? In This Article Understanding Next‑Token Prediction From Classic ML to Large‑Scale Transformers Practical Walk‑through Why It Matters Actionable Takeaways Frequently Asked Questions Understanding Next‑Token Prediction Next‑token prediction is the brain‑child of language modeling: the model receives a sequence of tokens x₁, x₂, …, xₙ and outputs a probability distribution over the next token xₙ₊₁ . The softmax layer turns hidden states into a vector of class probabilities, one per vocabulary entry. In practice, that’s millions ...