Hacker News — vinext + Cloudflare Workers

new
past
show
ask
show
jobs
submit

▲Show HN: Imagent – agentic image/video/speech generation (github.com)

4 points by unliftedq 13 hours ago | 2 comments

ankurchrungoo 7 hours ago [-]

I have been working on something similar but also quite different, and I find this interesting. Do you foresee that the cost of video generation, for example, be managed by the user via his own API keys configured for different platforms?

unliftedq 1 hours ago [-]

I built this tool to resolve my own pain point of agentic automation:

1. Existing CLI solution is provider specific, like minimax cli, chatgpt cli, etc. and for other providers, there's no built-in CLI support.

2. For local CLI/scripts solution, the generation result/history is not tracked, sometimes, I want to generate a similar image, I have to keep the prompts in a notebook. Now, with imagent, I can simply remix any prompt from the library.

3. CLI is the best solution for agent automation, I can use the cli to generate slides, blog illustrations, website assets, etc. And with it, I can even generate videos with hyperframes/remotions with great illustrations and speech audios. All it done by agent, I don't need to create images, audios by myself.

4. Agent isn't aware of the difference/limitation of different models, by maintain the catalogs, agent can discover what is available and choose the best options as its need. And call it in a unified interface.

So, the cost of video generation is not my focus at this point, what I care is automation. If we give the agent such ability, what the agent can create for us automatically. Not vibe code, but vibe creation.

Rendered at 16:05:56 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.