mccoyb 2 hours ago

Something something medical researcher reinvents calculus.

In 2026: frontend web developer reinvents tmux.

Guys, please do us the service of pre-filtering your crack token dreams by investigating the tool stack which is already available in the terminal ... or at least give us the courtesy of explaining why your vibecoded Greenspun's 10th something is a significant leg up on what already exists, and perhaps has existed for many years, (and is therefore, in the training set, and is therefore, probably going to work perfectly out of the box).

  • cossatot an hour ago

    Maybe, just maybe, this is of obvious utility to the many people who have needs that are not yours?

    I very regularly need to interact with my work through a python interpreter. My work is scientific programming. So the variables might be arrays with millions of elements. In order to debug, optimize, verify, or improve in any way my work, I cannot rely on any other methods than interacting with the code as it's being run, or while everything is still in memory. So if I want to really leverage LLMs, especially to allow them to work semi-autonomously, they must be able to do the same.

    I'm not going to dump tens of GB of stuff to a log file or send it around via pipes or whatever. Why is there a nan in an array that is the product of many earlier steps in a code that took an hour to run? Why are certain data in a 200k-variable system of equations much harder to fit than others, and which equations are in tension with each other to prevent better convergence?

    Are interpreters and pdb not great, previously-existing tools for this kind of work? Does a new tool that lets LLMs/agents use them actually represent some sort of hack job because better solutions have existed for years?

    • mccoyb an hour ago

      See related sibling: the use cases are compelling!

      My complaint is that tmux handles them perfectly. Exactly the claim that OP is making with their software - is served by robust 18 year old software.

      In 2026, it costs nearly nothing to thoroughly and autonomously investigate related software — so yes I am going to be purposefully abrasive about it.

    • joshribakoff an hour ago

      > I'm not going to dump tens of GB of stuff to a log file

      In the same vein as the parent comment, the curiosity is why you would vibe code a solution instead of reaching for grep.

  • dcchuck 2 hours ago

    I agree. We skipped CLIs and went all the way to TUIs because TUIs are "easy to make now"? Or maybe because claude/codex?

    But in practice you are padding token counts of agents reading streams of TUIs instead of leveraging standard unix pipes that have been around from day 1.

    TLDR - your agent wants a CLI anyway.

    Disclaimer: still a cool project and thank you to the author for sharing.

    • joshribakoff an hour ago

      The TUI makes more sense to humans who don’t understand the difference between a human and a machine.

8note 24 minutes ago

hehe, i made something similar for feedback loop on claude hooks. claude can open another claude instance in the testing folder, and check to see if the hooks fire properly

petcat 2 hours ago

Maybe I'll use this to feed prompts into an interactive Claude session so I can use my max subscription instead of having to pay for API credits when using claude -p

wolttam 2 hours ago

This is kind of fun, something I've been thinking about over the last couple days.

This is one area that makes me feel like our current LLM approach is just not quite general enough.

Yes, developers and power users love the command-line, because it is the most efficient way to accomplish many tasks. But it's rarely (never?) our only tool. We often reach for TUIs and GUIs.

It's why approaches like this get me excited: https://si.inc/posts/fdm1/

wild_egg 2 hours ago

I've had my agents using tmux for these use cases for a couple years now. What does TUI-use offer on top?

mikkupikku an hour ago

Are they any good at nethack?