#code

Public notes from activescott tagged with #code

All things code!

Monday, December 22, 2025

1.5 TB of VRAM on Mac Studio - RDMA over Thunderbolt 5 | Jeff Geerling

www.jeffgeerling.com/blog/2025/15-tb-vram-on-mac-studio-rdma-over-thunderbolt-5

RDMA lets the Macs all act like they have one giant pool of RAM, which speeds up things like massive AI models.

#6:06 AM

llm code open-source

exo-explore/exo: Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

github.com/exo-explore/exo

exo connects all your devices into an AI cluster. Not only does exo enable running models larger than would fit on a single device, but with day-0 support for RDMA over Thunderbolt, makes models run faster as you add more devices.

#6:04 AM

llm code open-source

Thursday, December 18, 2025

skypilot-org/skypilot: Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

github.com/skypilot-org/skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

#11:11 PM

gpu llm ai code open-source

Wednesday, December 17, 2025

dimdenGD/ultimate-express: The Ultimate Express. Fastest http server with full Express compatibility, based on µWebSockets.

github.com/dimdenGD/ultimate-express

The Ultimate Express. Fastest http server with full Express compatibility, based on µWebSockets.

This library is a very fast re-implementation of Express.js 4. It is designed to be a drop-in replacement for Express.js, with the same API and functionality, while being much faster. It is not a fork of Express.js. To make sure µExpress matches behavior of Express in all cases, we run all tests with Express first, and then with µExpress and compare results to make sure they match.

npm install ultimate-express -> replace express with ultimate-express -> done

#4:01 AM

js code benchmarks tools

Round 23 results - TechEmpower Framework Benchmarks

www.techempower.com/benchmarks/#section=data-r23

#3:59 AM

js code benchmarks tools

uNetworking/uWebSockets.js: μWebSockets for Node.js back-ends :metal:

github.com/uNetworking/uWebSockets.js/

µWebSockets.js is a standards compliant web server written in 10,000 lines of C++. It is exposed to Node.js as a simple-to-use, native V8 addon and performs at least 10x that of Socket.IO, 8.5x that of Fastify. It makes up the core components of Bun and is the fastest standards compliant web server in the TechEmpower (not endorsed) benchmarks.

#3:58 AM

js code benchmarks tools

Web Frameworks Benchmark

web-frameworks-benchmark.netlify.app/result

#3:49 AM

js code benchmarks

Tuesday, December 16, 2025

hiyouga/LLaMA-Factory: Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

github.com/hiyouga/LLaMA-Factory

Easily fine-tune 100+ large language models with zero-code CLI and Web UI

#7:09 AM

llm fine-tuning code

6476_SWE_bench_Can_Language_Mo.pdf

openreview.net/pdf?id=VTF8yNQM66

we in- troduce SWE-bench, an evaluation framework consisting of 2,294 software engineering problems drawn from real GitHub issues and corresponding pull requests across 12 popular Python repositories. Given a codebase along with a description of an issue to be resolved, a language model is tasked with editing the codebase to address the issue. Resolving issues in SWE-bench frequently requires under standing and coordinating changes across multiple functions, classes, and even files simultaneously, calling for models to interact with execution environments, process extremely long contexts and perform complex reasoning that goes far beyond traditional code generation tasks.

#7:06 AM

llm fine-tuning code benchmark

Unsloth AI - Open Source Fine-tuning & RL for LLMs

unsloth.ai/

Easy to use, well documented fine-tuning. NVIDIA optimized with AMD support and Apple M support in the works.

#6:52 AM

llm fine-tuning code

Monday, December 15, 2025

Monitoring Kubernetes pod performance metrics | Datadog

www.datadoghq.com/blog/monitoring-kubernetes-performance-metrics/

#6:34 PM

kubenetes observability alerting data-dog code

Sunday, December 14, 2025

Introducing Nested Learning: A new ML paradigm for continual learning

research.google/blog/introducing-nested-learning-a-new-ml-paradigm-for-continual-learning/

continually updating a model's parameters with new data, often leads to “catastrophic forgetting” (CF), where learning new tasks sacrifices proficiency on old tasks. Researchers traditionally combat CF through architectural tweaks or better optimization rules. However, for too long, we have treated the model's architecture (the network structure) and the optimization algorithm (the training rule) as two separate things, which prevents us from achieving a truly unified, efficient learning system.

By defining an update frequency rate, i.e., how often each component's weights are adjusted, we can order these interconnected optimization problems into "levels." This ordered set forms the heart of the Nested Learning paradigm.

We observed that many standard optimizers rely on simple dot-product similarity (a measure of how alike two vectors are by calculating the sum of the products of their corresponding components) whose update doesn't account for how different data samples relate to each other. By changing the underlying objective of the optimizer to a more standard loss metric, such as L2 regression loss (a common loss function in regression tasks that quantifies the error by summing the squares of the differences between predicted and true values), we derive new formulations for core concepts like momentum, making them more resilient to imperfect data.

In a standard Transformer, the sequence model acts as a short-term memory, holding the immediate context, while the feedforward neural networks act as long-term memory, storing pre-training knowledge. The Nested Learning paradigm extends this concept into what we call a “continuum memory system” (CMS), where memory is seen as a spectrum of modules, each updating at a different, specific frequency rate. This creates a much richer and more effective memory system for continual learning.

"Nested Learning" extends the traditional two-tier memory concept of "attention layers" (short-term memory / context window) and "feed-forward network layers" (long term memory) into a spectrum of modules that update at different rates, some very frequently (like attention), some rarely (like FFNs), and others at various points in between.

#6:50 PM

code google ml llm

Friday, December 12, 2025

webinstall.dev

webinstall.dev/

#8:29 PM

cli tools code

whalebrew/whalebrew: Homebrew, but with Docker images

github.com/whalebrew/whalebrew

Whalebrew creates aliases for Docker images so you can run them as if they were native commands. It's like Homebrew, but with Docker images.

#7:45 PM

homebrew tools code docker containers

Monday, December 8, 2025

CEL | Common Expression Language

cel.dev/

Common Expression Language (CEL) is an expression language that’s fast, portable, and safe to execute in performance-critical applications. CEL is designed to be embedded in an application, with application-specific extensions, and is ideal for extending declarative configurations that your applications might already use.

#3:27 AM

code open-source google