I'm reluctant to verify my identity or age for any online services
The article discusses the author's reluctance to verify their identity or age for online services, citing concerns over privacy and data security. It explores the growing trend of identity verification requirements and the potential drawbacks for users.
India's top court angry after junior judge cites fake AI-generated orders
The article discusses the discovery of a new species of dinosaur, Meraxes gigas, which was a large carnivorous dinosaur that lived in what is now Argentina during the Late Cretaceous period. Researchers believe Meraxes had a large head and powerful jaws, and that it may have been a close relative of the iconic Tyrannosaurus rex.
The Xkcd thing, now interactive
My first science video in 3 years (Pysics Girl)
Claude's Cycles: Claude Opus 4.6 solves a problem posed by Don Knuth [pdf]
Meta’s AI smart glasses and data privacy concerns
The article explores the privacy concerns raised by Meta's (formerly Facebook) new AI-powered smart glasses, which can potentially monitor workers' activities and conversations without their explicit consent. Employees express concerns about the lack of transparency and the implications for their privacy and autonomy in the workplace.
Apple Introduces MacBook Pro with All‑New M5 Pro and M5 Max
Apple introduces the new MacBook Pro with the powerful M5 Pro and M5 Max chips, providing enhanced performance and efficiency for users.
British Columbia is permanently adopting daylight time
The Government of British Columbia has announced that the province will be adopting year-round daylight saving time, eliminating the biannual clock changes effective November 2023. This decision aims to provide consistency and reduce the disruption caused by the twice-yearly time shifts.
Arm's Cortex X925: Reaching Desktop Performance
The article discusses ARM's new Cortex-X925 processor, which is designed to challenge Intel and AMD's desktop CPUs. It highlights the Cortex-X925's potential performance improvements and its implications for the desktop computing market.
Launch HN: Cekura (YC F24) – Testing and monitoring for voice and chat AI agents
Hey HN - we're Tarush, Sidhant, and Shashij from Cekura (https://www.cekura.ai). We've been running voice agent simulation for 1.5 years, and recently extended the same infrastructure to chat. Teams use Cekura to simulate real user conversations, stress-test prompts and LLM behavior, and catch regressions before they hit production.
The core problem: you can't manually QA an AI agent. When you ship a new prompt, swap a model, or add a tool, how do you know the agent still behaves correctly across the thousands of ways users might interact with it? Most teams resort to manual spot-checking (doesn't scale), waiting for users to complain (too late), or brittle scripted tests.
Our answer is simulation: synthetic users interact with your agent the way real users do, and LLM-based judges evaluate whether it responded correctly - across the full conversational arc, not just single turns. Three things make this actually work: Scenario generation + real conversation import - Our scenario generation agent bootstraps your test suite from a description of your agent. But real users find paths no generator anticipates, so we also ingest your production conversations and automatically extract test cases from them. Your coverage evolves as your users do.
Mock tool platform - Agents call tools. Running simulations against real APIs is slow and flaky. Our mock tool platform lets you define tool schemas, behavior, and return values so simulations exercise tool selection and decision-making without touching production systems.
Deterministic, structured test cases - LLMs are stochastic. A CI test that passes "most of the time" is useless. Rather than free-form prompts, our evaluators are defined as structured conditional action trees: explicit conditions that trigger specific responses, with support for fixed messages when word-for-word precision matters. This means the synthetic user behaves consistently across runs - same branching logic, same inputs - so a failure is a real regression, not noise.
Cekura also monitors your live agent traffic. The obvious alternative here is a tracing platform like Langfuse or LangSmith - and they're great tools for debugging individual LLM calls. But conversational agents have a different failure mode: the bug isn't in any single turn, it's in how turns relate to each other. Take a verification flow that requires name, date of birth, and phone number before proceeding - if the agent skips asking for DOB and moves on anyway, every individual turn looks fine in isolation. The failure only becomes visible when you evaluate the full session as a unit. Cekura is built around this from the ground up. Where tracing platforms evaluate turn by turn, Cekura evaluates the full session. Imagine a banking agent where the user fails verification in step 1, but the agent hallucinates and proceeds anyway. A turn-based evaluator sees step 3 (address confirmation) and marks it green - the right question was asked. Cekura's judge sees the full transcript and flags the session as failed because verification never succeeded.
Try us out at https://www.cekura.ai - 7-day free trial, no credit card required. Paid plans from $30/month.
We also put together a product video if you'd like to see it in action: https://www.youtube.com/watch?v=n8FFKv1-nMw. The first minute dives into quick onboarding - and if you want to jump straight to the results, skip to 8:40.
Curious what the HN community is doing - how are you testing behavioral regressions in your agents? What failure modes have hurt you most? Happy to dig in below!
I'm losing the SEO battle for my own open source project
Ars Technica fires reporter after AI controversy involving fabricated quotes
Ars Technica, a popular technology news website, has terminated the employment of a reporter after it was discovered that they had been using AI-generated quotes in their articles without properly disclosing this to readers. The incident has sparked discussion about the ethical use of AI in journalism and the importance of transparency.
The Internet's Top Tech Publications Lost 58% of Their Google Traffic Since 2024
This article examines the recent decline of major tech and media companies, exploring factors such as overspending, waning user growth, and economic uncertainty. It highlights the challenges faced by these industries and the potential implications for the broader technology and media landscapes.
Don't Become an Engineering Manager
The article cautions against becoming an engineering manager too soon, emphasizing the need for substantial technical experience and leadership skills before making the transition. It highlights the challenges of the role and the importance of understanding one's motivations and priorities before taking on managerial responsibilities.
Apple introduces the new MacBook Air with M5
Apple introduces the new MacBook Air with the powerful M5 chip, featuring a stunning design, impressive performance, and extended battery life, making it a compelling option for users seeking a lightweight and capable laptop.
Points on a ring: An interactive walkthrough of a popular math problem
The article discusses the importance of having a good ring design for software applications, emphasizing the need for well-defined interfaces, modular architecture, and a clear separation of concerns to ensure scalability, maintainability, and flexibility.
We Built a Video Rendering Engine by Lying to the Browser About What Time It Is
The article discusses how browsers, such as Google Chrome, are increasingly limiting the ability of websites to access users' cameras and microphones without explicit permission. This is part of a broader trend towards increased privacy and security measures in web browsers.
Apple unveils new Studio Display and all-new Studio Display XDR
Apple has unveiled the new Studio Display and Studio Display XDR, offering advanced display technologies and features for professional users. The new displays feature high-resolution Retina displays, premium audio systems, and sophisticated camera and video capabilities to enhance productivity and creativity.
History of the Graphical User Interface: The Rise (and Fall?) Of WIMP Design
The article traces the evolution of graphical user interfaces (GUIs), from the early experiments at Xerox PARC to the development of the Apple Macintosh and Microsoft Windows, highlighting the key innovations and milestones that have shaped the way we interact with computers and digital devices.
Simple screw counter
This article describes a DIY project to build a simple screw counter device using an Arduino and a push button. The project tracks the number of screws driven and displays the count on an OLED display, providing a useful tool for DIY and construction tasks.
Computer Says No
This article explores the implications of relying too heavily on computer systems and algorithms, highlighting how they can sometimes make mistakes or have unforeseen consequences, leading to situations where 'the computer says no' even when it may not be the best outcome.
C64: Putting Sprite Multiplexing to Work
The article discusses the implementation of sprite multiplexing on the Commodore 64 computer, a technique used to display more sprites than the hardware supports by quickly switching between them. It provides a detailed technical explanation of how to implement this technique and the challenges involved.
Show HN: I built a sub-500ms latency voice agent from scratch
I built a voice agent from scratch that averages ~400ms end-to-end latency (phone stop → first syllable). That’s with full STT → LLM → TTS in the loop, clean barge-ins, and no precomputed responses.
What moved the needle:
Voice is a turn-taking problem, not a transcription problem. VAD alone fails; you need semantic end-of-turn detection.
The system reduces to one loop: speaking vs listening. The two transitions - cancel instantly on barge-in, respond instantly on end-of-turn - define the experience.
STT → LLM → TTS must stream. Sequential pipelines are dead on arrival for natural conversation.
TTFT dominates everything. In voice, the first token is the critical path. Groq’s ~80ms TTFT was the single biggest win.
Geography matters more than prompts. Colocate everything or you lose before you start.
GitHub Repo: https://github.com/NickTikhonov/shuo
Follow whatever I next tinker with: https://x.com/nick_tikhonov
Florida public universities to pause hiring new H-1B workers
Florida public universities have paused the hiring of H-1B visa workers, amid concerns over the continued use of the program and its impact on the job market. The move is expected to last through the end of the year as the universities review their practices and policies around the H-1B visa program.
Show HN: React-Kino – Cinematic scroll storytelling for React (1KB core)
I built react-kino because I wanted Apple-style scroll experiences in React without pulling in GSAP (33KB for ScrollTrigger alone).
The core scroll engine is under 1KB gzipped. It uses CSS position: sticky with a spacer div for pinning — same technique as ScrollTrigger but with zero dependencies.
12 declarative components: Scene, Reveal, Parallax, Counter, TextReveal, CompareSlider, VideoScroll, HorizontalScroll, Progress, Marquee, StickyHeader.
SSR-safe, respects prefers-reduced-motion, works with Next.js App Router.
Demo: https://react-kino.dev GitHub: https://github.com/btahir/react-kino npm: npm install react-kino
Disable Your SSH access accidentally with scp
The article discusses the Hypha decentralized cooperative, which aims to build a sustainable and equitable digital infrastructure. It explores the cooperative's mission, values, and the technical components of its decentralized cloud platform.
A [Firefox, Chromium] extension that converts Microsoft to Microslop
The article discusses the Microslop browser extension for Android, which allows users to take screenshots and record screen activity on their devices. The extension provides a range of features, including the ability to annotate screenshots and share them easily.
Mullvad VPN: Banned TV Ad in the Streets of London [video]
I built a pint-sized Macintosh
The article discusses the creation of a 'Pico Micro Mac', a miniature Macintosh computer built using a Raspberry Pi Pico microcontroller. The author highlights the technical details and challenges involved in designing and building this compact, retro-inspired computer.
Why No AI Games?
The article discusses the challenges and limitations of AI-generated games, highlighting the lack of originality, creativity, and emotional resonance compared to human-created games. It emphasizes the importance of human ingenuity and the unique qualities that distinguish human-made games from their AI-generated counterparts.