New stories

kachapopopow 2 minutes ago

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

When using codex 5.3 the model will often do this:

Result: core local tools are stable; LSP is unavailable (expected in this harness context). Intent: Stress execution + queue-path inspection in one parallel batch (bash/python/find/grep/read together). +#+#+#+#+#+assistant to=multi_tool_use.parallel մեկնաբանություն 天天中彩票中了json

what is even more interesting is that: "մեկնաբանություն 天天中彩票中了json" will always be the same.

This issue does not occur in codex 5.2, both are ran with xhigh thinking. And it does not happen to everyone which makes me think there are different models being deployed.

As for opus 4.6: You're supposed to use @(file).

And it responded with: The diff shows the changes are already made. The working tree already has all the modifications you described <...>.

Refused to do anything else completely ignoring my input until I typed 2-3 sentances explaining exactly what to do, for some reason it kept looking at git diff insisting that "it is done".

Branching the conversation into opus 4.5 the issue was fixed as expected.

Vibe coded llms are here I guess!

1 0
melvinodsa 4 minutes ago

Vectors and HNSW for Dummies

anvitra.ai
1 0
Sanskrit AI beats CleanRL SOTA by 125%
prabhatkr 16 minutes ago

Sanskrit AI beats CleanRL SOTA by 125%

The article discusses the development of a Sanskrit language model using Proximal Policy Optimization (PPO) and its application to the Hopper-v5 environment. The model was trained on a large corpus of Sanskrit texts and demonstrated strong performance in generating coherent and grammatically correct Sanskrit sentences.

huggingface.co
1 1
Summary
thread_id 16 minutes ago

'Washington Post' CEO resigns after going AWOL during job cuts

npr.org
2 1
geeknews 18 minutes ago

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

twitter.com
1 0
TSMC to produce 3-nanometer chips in Japan
cwwc 20 minutes ago

TSMC to produce 3-nanometer chips in Japan

The Japanese government announced plans to provide additional support for low-income households to help offset the rising costs of living, including increased welfare benefits and subsidies for utilities and other essential expenses.

www3.nhk.or.jp
2 0
Summary
Quantization-Aware Distillation
paladin314159 21 minutes ago

Quantization-Aware Distillation

ternarysearch.blogspot.com
1 0
omosubi 23 minutes ago

List of Musical Genres

This article provides a comprehensive list of various music genres and styles, covering a wide range of musical traditions, including classical, popular, and world music genres.

en.wikipedia.org
1 0
Summary
BeinerChes 23 minutes ago

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

Agents connect via MCP, run autonomously, and self-moderate through karma. Claude, GPT, and open-source models side by side.

sknet.ai
1 0
Summary
University of Waterloo Webring
ark296 23 minutes ago

University of Waterloo Webring

cs.uwatering.com
1 0
Large tech companies don't need heroes
medbar 25 minutes ago

Large tech companies don't need heroes

The article explores the concept of heroism, discussing its historical origins, modern manifestations, and the role of self-sacrifice in heroic actions. It examines how heroism is defined, recognized, and celebrated in various cultural contexts.

seangoedecke.com
1 0
Summary
alance 25 minutes ago

Backing up all the little things with a Pi5

The article discusses the author's experience with setting up a home network-attached storage (NAS) system, highlighting the benefits of having a centralized storage solution and the process of choosing the right hardware and software components to build a customized NAS system.

alexlance.blog
1 1
Summary
akagusu 26 minutes ago

Game of Trees (Got)

gameoftrees.org
1 1
Human Systems Research Submolt
cl42 26 minutes ago

Human Systems Research Submolt

The article explores the complex relationship between humans and technology, highlighting the need to develop sustainable and ethical human-systems integration to ensure the well-being of both individuals and society as a whole.

moltbook.com
1 0
Summary
MBCook 28 minutes ago

The Threads Algorithm Loves Rage Bait

The article discusses the algorithmic bias of social media platforms, which tends to amplify divisive and inflammatory 'rage-bait' content, leading to the spread of misinformation and the exacerbation of societal tensions.

blog.popey.com
1 0
Summary
aej11 32 minutes ago

Search NYC open data to find building health complaints and other issues

nycbuildingcheck.com
1 0
lxm 33 minutes ago

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

nytimes.com
2 0
Show HN: Grovia – Long-Range Greenhouse Monitoring System
benbojangles 38 minutes ago

Show HN: Grovia – Long-Range Greenhouse Monitoring System

This project implements a long-range wireless environmental monitoring system for precision agriculture and horticultural research. It enables real-time tracking of critical growing conditions in a greenhouse located 3 kilometers away from the monitoring station, using low-power LoRa mesh networking technology.

github.com
1 1
fud101 38 minutes ago

Ask HN: The Coming Class War

For years now, it was clear if you wanted to do cutting edge research in ML you needed access to GPUs that could only be provided by billion dollar companies or governments. Now we're seeing the same spillover to general coding, where unless you can spend tokens, you can't compete against those who can pay afford to do that. How does this end?

I might pay $120 a year to GH copilot. I won't pay $2000/month to claude no matter how much hype their users claim, on principle.

2 4
Mind the GAAP Again
gmays 39 minutes ago

Mind the GAAP Again

The article discusses the limitations of Generally Accepted Accounting Principles (GAAP) in accurately representing the value of intangible assets, particularly in the context of technology companies. It argues that GAAP needs to evolve to better capture the true worth of these companies.

blog.dshr.org
1 0
Summary
The Yardbirds, Dazed and Confused (1968)
petethomas 40 minutes ago

The Yardbirds, Dazed and Confused (1968)

archive.org
1 0
Agent News Chat – AI agents talk to each other about the news
kiddz 41 minutes ago

Agent News Chat – AI agents talk to each other about the news

agentnewschat.com
2 0
a_n about 1 hour ago

Do you have a mathematically attractive face?

doimog.com
3 1
logicprog about 1 hour ago

Code only says what it does

The article discusses the importance of clean, modular, and well-commented code for effective collaboration and maintainability. It emphasizes the need for clear code structure, meaningful variable and function names, and thorough documentation to ensure that the codebase is easy to understand and modify by other developers.

brooker.co.za
2 0
Summary
logicprog about 1 hour ago

The success of 'natural language programming'

The article discusses the potential of natural language processing (NLP) to revolutionize various industries, highlighting its ability to process and interpret human language, and its applications in areas such as customer service, content creation, and decision-making.

brooker.co.za
1 0
Summary
The Scriptovision Super Micro Script video titler is almost a home computer
todsacerdoti about 1 hour ago

The Scriptovision Super Micro Script video titler is almost a home computer

The article discusses the development of Scriptovision, a super-micro script format that allows for the creation of complex interactive media with a small file size. It explores the technical details and potential applications of this new script format.

oldvcr.blogspot.com
3 0
Summary
fortran77 about 1 hour ago

Discovering the "original" iPhone from 1995 [video]

youtube.com
1 0
YouTube
Psychometric Comparability of LLM-Based Digital Twins
PaulHoule about 1 hour ago

Psychometric Comparability of LLM-Based Digital Twins

The article presents a novel deep learning architecture, called Latent Transformer, that can effectively model and generate high-resolution images. The model utilizes a transformer-based approach to capture long-range dependencies and achieves state-of-the-art performance on various image generation benchmarks.

arxiv.org
1 0
Summary
ecaglar about 1 hour ago

SidePop – track revenue, costs, and overall business health in one place

sidepop.io
1 1
tzury about 1 hour ago

The Other Markov's Inequality

The article discusses Markov's inequality, a fundamental result in probability theory, and its relationship to other versions of the inequality. It explores the history and the significance of this mathematical concept in understanding the behavior of random variables.

ethanepperly.com
2 0
Summary