Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?
When using codex 5.3 the model will often do this:
Result: core local tools are stable; LSP is unavailable (expected in this harness context). Intent: Stress execution + queue-path inspection in one parallel batch (bash/python/find/grep/read together). +#+#+#+#+#+assistant to=multi_tool_use.parallel մեկնաբանություն 天天中彩票中了json
what is even more interesting is that: "մեկնաբանություն 天天中彩票中了json" will always be the same.
This issue does not occur in codex 5.2, both are ran with xhigh thinking. And it does not happen to everyone which makes me think there are different models being deployed.
As for opus 4.6: You're supposed to use @(file).
And it responded with: The diff shows the changes are already made. The working tree already has all the modifications you described <...>.
Refused to do anything else completely ignoring my input until I typed 2-3 sentances explaining exactly what to do, for some reason it kept looking at git diff insisting that "it is done".
Branching the conversation into opus 4.5 the issue was fixed as expected.
Vibe coded llms are here I guess!
Vectors and HNSW for Dummies
Sanskrit AI beats CleanRL SOTA by 125%
The article discusses the development of a Sanskrit language model using Proximal Policy Optimization (PPO) and its application to the Hopper-v5 environment. The model was trained on a large corpus of Sanskrit texts and demonstrated strong performance in generating coherent and grammatically correct Sanskrit sentences.
'Washington Post' CEO resigns after going AWOL during job cuts
Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive
TSMC to produce 3-nanometer chips in Japan
The Japanese government announced plans to provide additional support for low-income households to help offset the rising costs of living, including increased welfare benefits and subsidies for utilities and other essential expenses.
Quantization-Aware Distillation
List of Musical Genres
This article provides a comprehensive list of various music genres and styles, covering a wide range of musical traditions, including classical, popular, and world music genres.
Show HN: Sknet.ai – AI agents debate on a forum, no humans posting
Agents connect via MCP, run autonomously, and self-moderate through karma. Claude, GPT, and open-source models side by side.
University of Waterloo Webring
Large tech companies don't need heroes
The article explores the concept of heroism, discussing its historical origins, modern manifestations, and the role of self-sacrifice in heroic actions. It examines how heroism is defined, recognized, and celebrated in various cultural contexts.
Backing up all the little things with a Pi5
The article discusses the author's experience with setting up a home network-attached storage (NAS) system, highlighting the benefits of having a centralized storage solution and the process of choosing the right hardware and software components to build a customized NAS system.
Game of Trees (Got)
Human Systems Research Submolt
The article explores the complex relationship between humans and technology, highlighting the need to develop sustainable and ethical human-systems integration to ensure the well-being of both individuals and society as a whole.
The Threads Algorithm Loves Rage Bait
The article discusses the algorithmic bias of social media platforms, which tends to amplify divisive and inflammatory 'rage-bait' content, leading to the spread of misinformation and the exacerbation of societal tensions.
Search NYC open data to find building health complaints and other issues
Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change
Show HN: Grovia – Long-Range Greenhouse Monitoring System
This project implements a long-range wireless environmental monitoring system for precision agriculture and horticultural research. It enables real-time tracking of critical growing conditions in a greenhouse located 3 kilometers away from the monitoring station, using low-power LoRa mesh networking technology.
Ask HN: The Coming Class War
For years now, it was clear if you wanted to do cutting edge research in ML you needed access to GPUs that could only be provided by billion dollar companies or governments. Now we're seeing the same spillover to general coding, where unless you can spend tokens, you can't compete against those who can pay afford to do that. How does this end?
I might pay $120 a year to GH copilot. I won't pay $2000/month to claude no matter how much hype their users claim, on principle.
Mind the GAAP Again
The article discusses the limitations of Generally Accepted Accounting Principles (GAAP) in accurately representing the value of intangible assets, particularly in the context of technology companies. It argues that GAAP needs to evolve to better capture the true worth of these companies.
The Yardbirds, Dazed and Confused (1968)
Agent News Chat – AI agents talk to each other about the news
Do you have a mathematically attractive face?
Code only says what it does
The article discusses the importance of clean, modular, and well-commented code for effective collaboration and maintainability. It emphasizes the need for clear code structure, meaningful variable and function names, and thorough documentation to ensure that the codebase is easy to understand and modify by other developers.
The success of 'natural language programming'
The article discusses the potential of natural language processing (NLP) to revolutionize various industries, highlighting its ability to process and interpret human language, and its applications in areas such as customer service, content creation, and decision-making.
The Scriptovision Super Micro Script video titler is almost a home computer
The article discusses the development of Scriptovision, a super-micro script format that allows for the creation of complex interactive media with a small file size. It explores the technical details and potential applications of this new script format.
Discovering the "original" iPhone from 1995 [video]
Psychometric Comparability of LLM-Based Digital Twins
The article presents a novel deep learning architecture, called Latent Transformer, that can effectively model and generate high-resolution images. The model utilizes a transformer-based approach to capture long-range dependencies and achieves state-of-the-art performance on various image generation benchmarks.
SidePop – track revenue, costs, and overall business health in one place
The Other Markov's Inequality
The article discusses Markov's inequality, a fundamental result in probability theory, and its relationship to other versions of the inequality. It explores the history and the significance of this mathematical concept in understanding the behavior of random variables.