Top stories

pabs3 about 10 hours ago

Motorola GrapheneOS devices will be bootloader unlockable/relockable

The article discusses the potential security and privacy benefits of the GrapheneOS mobile operating system, which is designed to provide enhanced security and control compared to mainstream mobile OS options.

grapheneos.social
733 229
Summary
P_qRs about 4 hours ago

RFC 9849. TLS Encrypted Client Hello

RFC 9849 specifies the Internet Engineering Task Force's (IETF's) processes for managing and maintaining the set of Request for Comments (RFCs), which serve as the authoritative document series for Internet standards and best current practices.

rfc-editor.org
73 19
Summary
r4um about 6 hours ago

Agentic Engineering Patterns

This article explores agentic engineering patterns, which focus on empowering users and fostering agency within software systems. It discusses techniques such as progressive disclosure, user-defined defaults, and UI affordances that enable users to customize and tailor applications to their specific needs.

simonwillison.net
154 51
Summary
Better JIT for Postgres
vladich about 5 hours ago

Better JIT for Postgres

The article discusses the pg_jitter extension for PostgreSQL, which adds jitter to query planning and execution to help with performance and workload randomization. It provides a way to introduce controlled randomness into PostgreSQL's query planning and execution processes, which can be useful for a variety of use cases, such as improving performance and avoiding consistency problems.

github.com
56 13
Summary
A CPU that runs entirely on GPU
cypres about 7 hours ago

A CPU that runs entirely on GPU

The article describes the development of nCPU, an open-source, modular CPU architecture that can be implemented using field-programmable gate arrays (FPGAs). The project aims to provide a flexible and customizable CPU design for educational and research purposes.

github.com
98 39
Summary
The next era of social media: built and run in Europe, ruled by our laws
doener about 1 hour ago

The next era of social media: built and run in Europe, ruled by our laws

eurosky.tech
9 3
MacBook Pro with M5 Pro and M5 Max
scrlk about 21 hours ago

MacBook Pro with M5 Pro and M5 Max

Apple introduces the new MacBook Pro with the powerful M5 Pro and M5 Max chips, providing enhanced performance and efficiency for users.

apple.com
789 830
Summary
TikTok will not introduce end-to-end encryption, saying it makes users less safe
1659447091 about 10 hours ago

TikTok will not introduce end-to-end encryption, saying it makes users less safe

The article discusses the rising costs of living in the UK, with inflation reaching its highest level in 40 years. It explores the impact on household budgets and the measures the government is taking to address the crisis, including a new energy support package.

bbc.com
257 201
Summary
abetusk about 9 hours ago

Graphics Programming Resources

develop--gpvm-website.netlify.app
98 11
Show HN: Rust compiler in PHP emitting x86-64 executables
mrconter11 3 days ago

Show HN: Rust compiler in PHP emitting x86-64 executables

The article discusses the development of a PHP extension for the Rust compiler, allowing Rust code to be executed within PHP applications. This integration aims to leverage Rust's performance and safety benefits to enhance the capabilities of PHP-based web applications.

github.com
40 32
Summary
Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups
lixiasky 1 day ago

Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups

coroTracer is an open-source contact tracing tool that utilizes Bluetooth Low Energy (BLE) technology to track potential COVID-19 exposure. The system aims to provide a privacy-preserving solution for tracking and notifying individuals who may have been in close contact with confirmed COVID-19 cases.

github.com
8 0
Summary
jruohonen 3 days ago

On the Design of Programming Languages (1974) [pdf]

web.cs.ucdavis.edu
45 6
fs123 1 day ago

Claude's Cycles [pdf]

www-cs-faculty.stanford.edu
642 268
vnglst 4 days ago

Show HN: Stacked Game of Life

The article describes a web-based implementation of the Game of Life, a cellular automaton that simulates the evolution of a two-dimensional grid of cells. The game allows users to create and observe complex patterns emerging from simple initial configurations.

stacked-game-of-life.koenvangilst.nl
19 7
Summary
Voxile: A ray-traced game made in its own engine and programming language
spacemarine1 about 14 hours ago

Voxile: A ray-traced game made in its own engine and programming language

Voxray Games, an indie game studio, has released a major update to their game, featuring new gameplay mechanics, improved graphics, and expanded content. The article details the key changes and improvements made to the game, highlighting the studio's commitment to enhancing the player experience.

elbowgreasegames.substack.com
199 56
Summary
dmckinno about 11 hours ago

My spicy take on vibe coding for PMs

The article discusses the concept of 'vibe coding' for product managers, which focuses on creating a positive and productive work environment through effective communication and collaboration. The author shares their personal take on the benefits and challenges of incorporating vibe coding into product management practices.

ddmckinnon.com
104 101
Summary
giancarlostoro 3 days ago

Textadept

Textadept is a lightweight and customizable text editor designed for programmers, featuring cross-platform compatibility, rapid scripting capabilities, and a variety of plugins and themes to enhance productivity and workflow.

orbitalquark.github.io
140 23
Summary
Speculative Speculative Decoding (SSD)
E-Reverance about 8 hours ago

Speculative Speculative Decoding (SSD)

The article presents a novel approach to natural language processing using transformer-based models. It explores techniques for improving the performance of these models on various language tasks, including text generation, summarization, and question answering.

arxiv.org
44 7
Summary
Reverse-Engineering the Wetware: Spiking Networks and the End of Matrix Math
pgte 3 days ago

Reverse-Engineering the Wetware: Spiking Networks and the End of Matrix Math

This article explores the potential limitations of neural networks and matrix math in modeling biological neural networks. It proposes an alternative approach focused on reverse-engineering the 'wetware' of the brain, including spiking networks and temporal difference errors, as a way to develop more biologically-inspired AI systems.

metaduck.com
23 8
Summary
You can use newline characters in URLs
chmaynard 3 days ago

You can use newline characters in URLs

The article discusses how newline characters can be used in URLs, which can have unexpected consequences for website owners and users. It explores the potential implications and provides recommendations for handling such URLs.

lemire.me
79 35
Summary
todsacerdoti about 18 hours ago

When AI writes the software, who verifies it?

The article explores the potential impact of AI systems on the software development industry, discussing how AI could automate code writing and revolutionize the software development process, while also raising concerns about the ethical and societal implications of this technology.

leodemoura.github.io
240 236
Summary
Welcoming Elizabeth Barron as the New Executive Director of the PHP Foundation
ulrischa 3 days ago

Welcoming Elizabeth Barron as the New Executive Director of the PHP Foundation

The article announces the appointment of Elizabeth Barron as the new Executive Director of The PHP Foundation, a non-profit organization that supports and promotes the PHP programming language. It highlights Barron's experience and expertise, and outlines her goals to further the foundation's mission and community engagement.

thephp.foundation
29 17
Summary
An Interactive Intro to CRDTs (2023)
evakhoury about 16 hours ago

An Interactive Intro to CRDTs (2023)

The article provides an interactive introduction to Conflict-free Replicated Data Types (CRDTs), which are a class of data structures used for building distributed, real-time applications. It explains the concept of CRDTs, their advantages over traditional approaches, and how they can be used to build collaborative systems.

jakelazaroff.com
150 23
Summary
atarus about 21 hours ago

Launch HN: Cekura (YC F24) – Testing and monitoring for voice and chat AI agents

Hey HN - we're Tarush, Sidhant, and Shashij from Cekura (https://www.cekura.ai). We've been running voice agent simulation for 1.5 years, and recently extended the same infrastructure to chat. Teams use Cekura to simulate real user conversations, stress-test prompts and LLM behavior, and catch regressions before they hit production.

The core problem: you can't manually QA an AI agent. When you ship a new prompt, swap a model, or add a tool, how do you know the agent still behaves correctly across the thousands of ways users might interact with it? Most teams resort to manual spot-checking (doesn't scale), waiting for users to complain (too late), or brittle scripted tests.

Our answer is simulation: synthetic users interact with your agent the way real users do, and LLM-based judges evaluate whether it responded correctly - across the full conversational arc, not just single turns. Three things make this actually work: Scenario generation + real conversation import - Our scenario generation agent bootstraps your test suite from a description of your agent. But real users find paths no generator anticipates, so we also ingest your production conversations and automatically extract test cases from them. Your coverage evolves as your users do.

Mock tool platform - Agents call tools. Running simulations against real APIs is slow and flaky. Our mock tool platform lets you define tool schemas, behavior, and return values so simulations exercise tool selection and decision-making without touching production systems.

Deterministic, structured test cases - LLMs are stochastic. A CI test that passes "most of the time" is useless. Rather than free-form prompts, our evaluators are defined as structured conditional action trees: explicit conditions that trigger specific responses, with support for fixed messages when word-for-word precision matters. This means the synthetic user behaves consistently across runs - same branching logic, same inputs - so a failure is a real regression, not noise.

Cekura also monitors your live agent traffic. The obvious alternative here is a tracing platform like Langfuse or LangSmith - and they're great tools for debugging individual LLM calls. But conversational agents have a different failure mode: the bug isn't in any single turn, it's in how turns relate to each other. Take a verification flow that requires name, date of birth, and phone number before proceeding - if the agent skips asking for DOB and moves on anyway, every individual turn looks fine in isolation. The failure only becomes visible when you evaluate the full session as a unit. Cekura is built around this from the ground up. Where tracing platforms evaluate turn by turn, Cekura evaluates the full session. Imagine a banking agent where the user fails verification in step 1, but the agent hallucinates and proceeds anyway. A turn-based evaluator sees step 3 (address confirmation) and marks it green - the right question was asked. Cekura's judge sees the full transcript and flags the session as failed because verification never succeeded.

Try us out at https://www.cekura.ai - 7-day free trial, no credit card required. Paid plans from $30/month.

We also put together a product video if you'd like to see it in action: https://www.youtube.com/watch?v=n8FFKv1-nMw. The first minute dives into quick onboarding - and if you want to jump straight to the results, skip to 8:40.

Curious what the HN community is doing - how are you testing behavioral regressions in your agents? What failure modes have hurt you most? Happy to dig in below!

84 20
Indefinite Book Club Hiatus
cdrnsf about 7 hours ago

Indefinite Book Club Hiatus

The author announces an indefinite hiatus for their book club due to personal commitments and the need to focus on other projects. They express gratitude to the book club members and their understanding during this break.

whatever.scalzi.com
27 13
Summary
Weave – A language aware merge algorithm based on entities
rs545837 about 9 hours ago

Weave – A language aware merge algorithm based on entities

Weave is an open-source, decentralized communication protocol that enables secure and privacy-preserving messaging between users. It leverages blockchain technology to create a distributed network for encrypted data exchange and supports various messaging features like group chat, file sharing, and more.

github.com
123 86
Summary
1659447091 about 10 hours ago

The largest acidic geyser has been putting on quite a show

The Echinus Geyser in Yellowstone National Park has resumed eruptions after a period of dormancy, according to the U.S. Geological Survey. The geyser, which is one of the park's most predictable geothermal features, has been observed to be actively erupting and drawing the interest of visitors and scientists.

usgs.gov
50 1
Summary
meetpateltech about 17 hours ago

GPT‑5.3 Instant

OpenAI announces the launch of GPT-5.3, a powerful new language model with advanced capabilities in natural language processing, content generation, and task completion.

openai.com
355 277
Summary
eieio about 8 hours ago

Number Research Inc

numberresearch.xyz
35 19
Don't become an engineering manager
flail about 21 hours ago

Don't become an engineering manager

The article cautions against becoming an engineering manager too soon, emphasizing the need for substantial technical experience and leadership skills before making the transition. It highlights the challenges of the role and the importance of understanding one's motivations and priorities before taking on managerial responsibilities.

newsletter.manager.dev
358 255
Summary