#open-source + #llm
Public notes from activescott tagged with both #open-source and #llm
Tuesday, April 28, 2026
Thursday, April 23, 2026
Introducing OpenAI Privacy Filter | OpenAI
Today we’re releasing OpenAI Privacy Filter, an open-weight model for detecting and redacting personally identifiable information (PII) in text.
It is designed for high-throughput privacy workflows, and is able to perform context-aware detection of PII in unstructured text. It can run locally, which means that PII can be masked or redacted without leaving your machine. It processes long inputs efficiently, making redaction decisions in a quick, single pass.
Sunday, February 22, 2026
FunctionGemma model overview | Google AI for Developers
FunctionGemma is a specialized version of our Gemma 3 270M model tuned for function calling. It is designed as a strong base for further training into custom, fast, private, local agents that translate natural language into executable API actions.
Sunday, January 25, 2026
Portkey-AI/gateway: A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
The AI Gateway is designed for fast, reliable & secure routing to 1600+ language, vision, audio, and image models. It is a lightweight, open-source, and enterprise-ready solution that allows you to integrate with any language model in under 2 minutes.
Blazing fast (<1ms latency) with a tiny footprint (122kb) Battle tested, with over 10B tokens processed everyday Enterprise-ready with enhanced security, scale, and custom deploymentsWhat can you do with the AI Gateway?
Integrate with any LLM in under 2 minutes - Quickstart Prevent downtimes through automatic retries and fallbacks Scale AI apps with load balancing and conditional routing Protect your AI deployments with guardrails Go beyond text with multi-modal capabilities Explore agentic workflow integrations Manage MCP servers with enterprise auth & observability using MCP Gateway
Home - Phoenix
Arize Phoenix: Open-source LLM tracing and evaluation Evaluate, experiment, and optimize AI products in real time.
Wednesday, January 14, 2026
My answers to the questions I posed about porting open source code with LLMs
the short version is that it’s now possible to point a coding agent at some other open source project and effectively tell it “port this to language X and make sure the tests still pass” and have it do exactly that.
the short version is that it’s now possible to point a coding agent at some other open source project and effectively tell it “port this to language X and make sure the tests still pass” and have it do exactly that.
Does this library represent a legal violation of copyright of either the Rust library or the Python one? #
I decided that the right thing to do here was to keep the open source license and copyright statement from the Python library author and treat what I had built as a derivative work, which is the entire point of open source.
Even if this is legal, is it ethical to build a library in this way? #
After sitting on this for a while I’ve come down on yes, provided full credit is given and the license is carefully considered. Open source allows and encourages further derivative works! I never got upset at some university student forking one of my projects on GitHub and hacking in a new feature that they used. I don’t think this is materially different, although a port to another language entirely does feel like a slightly different shape.
The much bigger concern for me is the impact of generative AI on demand for open source. The recent Tailwind story is a visible example of this—while Tailwind blamed LLMs for reduced traffic to their documentation resulting in fewer conversions to their paid component library, I’m suspicious that the reduced demand there is because LLMs make building good-enough versions of those components for free easy enough that people do that instead.
Monday, December 22, 2025
1.5 TB of VRAM on Mac Studio - RDMA over Thunderbolt 5 | Jeff Geerling
RDMA lets the Macs all act like they have one giant pool of RAM, which speeds up things like massive AI models.
exo-explore/exo: Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
exo connects all your devices into an AI cluster. Not only does exo enable running models larger than would fit on a single device, but with day-0 support for RDMA over Thunderbolt, makes models run faster as you add more devices.
Thursday, December 18, 2025
skypilot-org/skypilot: Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).
Monday, December 8, 2025
Essential AI
Rnj-1 is an 8B model that roughly follows the open-source Gemma 3 architecture. We employ global self-attention and YaRN to extend the context to 32k. The Rnj-1 Base and Instruct models compare favorably against similarly sized open weight models.
Rnj-1 Instruct dominates the pack on Agentic coding, one of our target abilities. SWE bench performance is indicative of the model's ability to tackle everyday software engineering tasks. We are an order of magnitude stronger than comparably sized models on SWE-bench and approach the capabilities available in much larger models (leaderboard: SWE-bench-Verified bash-only).