- Joined
- Dec 17, 2019
Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
I like the distributed model. I can start it on any one of my systems then control it from my desktop. I also wrote a little script that watches the queue on one node and ships jobs to other nodes when they are idle.Hopefully it'll become truly standalone in the near future, without having to do this ugly dichotomy of a command line and a browser tab to use it.
Turns out that there are already works on a desktop version with it's own Electron framework. I tried migrating to it but it's a goddamn mess, so for the time being a good alternative for a clean experience for me is just running ComfyUI as a service via NSSM and using the PWA functionality of my browser. I have so much RAM and VRAM to spare it won't interfere with my day-to-day use.I like the distributed model. I can start it on any one of my systems then control it from my desktop. I also wrote a little script that watches the queue on one node and ships jobs to other nodes when they are idle.
Any info about whether or not it's possible to run it on consumer GPU's? They recommend 80GB GPU's which only exist in the enterprise hardware sphere, or the home lunatic sphere where you NVLink four 3090's together.
You can run Hunyuan on as low as 8-12gb with specific workflows, it seems like you need at least 24 for the img2video model at the moment but I expect that to go down as people do their magic
Probably this lora or something similar: https://civitai.com/models/1346623/360-degree-rotation-microwave-rotation-wan21-i2v-loraHow was this made? I would love to know what was used to make this turntable video.
Grok AI from twitter apparently. Here is a main thread https://kiwifarms.st/threads/the-studio-ghibli-meme-gallery.215562/What is being used for the studio ghibli style images that I am seeing on twitter today? Some are saying it's GPT 4.0 + a source image. Others are saying this is being blocked already.
Grok might be used for a couple of these, but the majority are made with ChatGPT's new 4o image generation.Grok AI from twitter apparently. Here is a main thread https://kiwifarms.st/threads/the-studio-ghibli-meme-gallery.215562/
I wasn't able to generate that image because the subject appears to be a recognizable real person, and I need to follow guidelines that protect individuals' privacy and identity. If you have another idea or request—like an original character, a concept, or a stylized version of a different scene—I’d be happy to help!
The silver lining is that Chinese firms now have a firm target to beat, 4o native image gen appears to be a multimodal autoregression model (tokenizing each pixel and doing reasoning in pixel space) while FOSS solutions used diffusion and write always ahead of autoregression in terms of resolution and quality.Unfortunately, as always happens with closed source models, OpenAI is already nerfing it.
I had a good experience with fireworks.ai's playground showcase of Deepseek V3 03-24 yesterday. At this point chyna's open model weights can be hosted by API providers cheaply enough that it's free for casual use, almost free for everything else, and still performs pretty close to cutting edge. Fingers crossed that they keep not having to care about pressures to censor.And they're also usually far larger than diffusion models so good luck running those locally, best case scenario a model is open sourced and you get to choose from a selection of providers that run it uncensored.
Hard to say because their exact method is unknown. Back in gpt-4o-vision that would just make calls to DALL-E, a 512x512 tile would be about 6k tokens, and that's also a regression model. It could probably be tens of thousands.How many tokens do you think are in a 512x512 image for performance like 4o is showing, by the way? For a ballpark estimate of how much this will cost if API services with the model weights charge by the token.
thats just a jeep wrangler basically
I really like the Todd one.My newest hobby since the new OpenAi Img gen is to feed it random images and turn it into 40k.
View attachment 7158179View attachment 7158180
View attachment 7158181
View attachment 7158182
View attachment 7158186
View attachment 7158184
The new model is rolling out to paid users starting today.