Stable Diffusion, NovelAI, Machine Learning Art - AI art generation discussion and image dump

  • 🐕 I am attempting to get the site runnning as fast as possible. If you are experiencing slow page load times, please report it.
Were you looking at roughly the same generation times for Noob and Illustrious when compared to Pony?
Yes; the only variation in generation times were from the sampler used, steps chosen and resolution as neither are turbo models or use the LCM sampler. Wasn't really testing for generation time or paying much attention to it but if there were an outlier I likely would have noticed.
 
  • Like
Reactions: Baraadmirer
I've been playing around with PixAI and Bing to generate a few random characters I thought up. The titles don't really mean anything, I just thought they sounded cool







These ain't too bad, sucks that it's practically a paid service. I wonder how many more years it's gonna take to get fingers right though lol.

Still, this is probably the best anime art generator I've seen, aside from maybe that one Chinese service where you turn real pictures into anime art, but that restricted use from people outside China or something like that. Then again, I haven't tried very many.
 
Illustrious (and to a lesser extent, NoobXL) seems to be a competitor for PonyXL now. Anyone try it out yet?
Impressively good at mixing concepts, works great without the need for any easynegative embeds and has a fairly wide mix of styles outside of anime despite what the images suggest. It even creates readable text sometimes, but not as well as Flux. I see no need to return to Pony, it's a direct upgrade.
 
Impressively good at mixing concepts, works great without the need for any easynegative embeds and has a fairly wide mix of styles outside of anime despite what the images suggest. It even creates readable text sometimes, but not as well as Flux. I see no need to return to Pony, it's a direct upgrade.
Very interesting. I've been trying to branch out with LoRA training, so I've had my eyes set on working with Pony as that appears to be what most people are still using. Gonna consider doing it for Illustrious...
 
Haven’t checked this thread in a long time. The last time I was active here, I believe some folks were asking me why it was taking my 4090 such a long time to train LoRA. I never got around to testing and providing any additional details. I’ll be sure to do so when I get the chance.

Illustrious (and to a lesser extent, NoobXL) seems to be a competitor for PonyXL now. Anyone try it out yet?
I’m really enjoying the Illustrious models. One thing they do really well out of the box is getting your subject to hold an item. With minimal generations, I was able to get some bad ass armor-clad knights to swing around huge war hammers.

As far as NoobXL and all of that VPred stuff goes, have any of you had good results with it? I haven’t been impressed by what I’ve been able to generate. Everything shares this kind of blurry and washed out quality.
 
Pony was only good in spite of the creator. Pony was trained weirdly with obfuscated artist tags causing a bunch of collisions and that's why your required to use stlye LoRAs, and the CLIP is toasted because the author wanted it trained faster, so Pony being as good as it is while also being completely scuffed is a sign of how far SDXL can actually go. Illustrious is an example of that, its understanding of concepts is actually incredible and you can do so much with it without needing a LoRA. You can get such a clean result with a very low sample count, a result that looks like it was run through a refiner, but its just the base model.
 
As far as NoobXL and all of that VPred stuff goes, have any of you had good results with it? I haven’t been impressed by what I’ve been able to generate. Everything shares this kind of blurry and washed out quality.
Are you using the right settings? It only works right if you're using Euler as the sampler and the number of steps and CFG have to be in a pretty narrow range too.
 
Are you using the right settings? It only works right if you're using Euler as the sampler and the number of steps and CFG have to be in a pretty narrow range too.
Correct, and have tried the various schedulers as well. Just not a pleasing result to me. Do you like the end results you’re receiving?
 
Correct, and have tried the various schedulers as well. Just not a pleasing result to me. Do you like the end results you’re receiving?
After messing around a bit I see it too. It seems like it's trained so heavily and drawings it can't handle trying to do photorealism the way other models can, which is a shame but it still makes some pretty good results so long as you play to its strengths and don't look at it too closely. If it's still looking weird regardless of what the prompt is using the default SDXL VAE instead of the built-in one might help.
 
I've been enjoying illustrious_pencil-XL a good fair while using the SwarmUI over Forge. Swarm seems to just keep pushing me to learn the tangled web that is Comfy UI since it offers that along with a Automatic1111-like interface. I recommend people check that one out for swapping between the two types of UI
 
Is there a SDXL software package with a decent mobile webui GUI? All of them are terrible on phones.
 
Is there a SDXL software package with a decent mobile webui GUI? All of them are terrible on phones.
Have you looked into Lobe? I ran it alongside Forge for a while. It provided a decent alternative, but it seems to lack frequent updates. It compartmentalized a lot of crap into sidebars that you could swipe left or right from edges to bring forward, like checkpoints and LoRAs, etc., and it’s highly customizable. Downside, for me, was that it made my phone run extremely hot when using and it would frequently refresh (latest iPhone and iOS version). It would also drain battery quickly because of it which was enough for me to revert back to the ugly and cumbersome A1111/Forge mobile UI.
 
How the hell are you running Stable Diffusion on a phone?
You run it on a machine with a GPU. Have it listen on 0.0.0.0, and then connect to it on your network or over the Internet using a domain. Cloudflare tunnels make it super easy to do that.

I have this setup with all of my ML apps. Including my KoboldCPP instance. I run an IDE with Continue, so I need a Gemma 2 instance for remote usage. Just be sure to secure it with an API key.
 
Would anybody be kind enough to update me on the last eight months in the world of AI image generation? The state of things when I checked out of playing around with this stuff was:
  • SDXL was more or less the standard though lots of people still playing around with SD1.5. There were lots of LORAs that could be used with them and a thriving world of custom checkpoints.
  • SD3 had been available via the API (which I used and wasn't bad imo) and its cut down public release which was derided from all quarters. There were next to no custom checkpoints of home grown LORAs.
  • Flux has just been released both the API version and the two public versions. I played around with these a little in ComfyUI and was impressed. But there were no custom checkpoints or LORAs. In fact, I think it was still being debated how possible those were.

And that's more or less where I drifted away from the scene. I want to fire up Runpod and make some art for a project. I'm not really sure where to start. I went on Civitai and it now seems there are LoRAs for Flux D (?) and there seem to be some customized models as well?

Really just trying to get a feel for what's popular and most supported by the community right now. I plan to use Runpod and am more interested in quality than anything else - i.e. it's no problem if the model requires 32GB of VRAM or something if I'm just paying a couple of dollars for a few hours. But I don't want to start down the wrong track if suddenly SD3 has caught up and is the new SDXL, or if I should be using some particular popular Flux model or whatever.

Appreciate any pointing in the right direction. Currently trying to just figure out the new normal from reading and browsing but you know what a big area this is.
 
Would anybody be kind enough to update me on the last eight months in the world of AI image generation? The state of things when I checked out of playing around with this stuff was:
  • SDXL was more or less the standard though lots of people still playing around with SD1.5. There were lots of LORAs that could be used with them and a thriving world of custom checkpoints.
  • SD3 had been available via the API (which I used and wasn't bad imo) and its cut down public release which was derided from all quarters. There were next to no custom checkpoints of home grown LORAs.
  • Flux has just been released both the API version and the two public versions. I played around with these a little in ComfyUI and was impressed. But there were no custom checkpoints or LORAs. In fact, I think it was still being debated how possible those were.

And that's more or less where I drifted away from the scene. I want to fire up Runpod and make some art for a project. I'm not really sure where to start. I went on Civitai and it now seems there are LoRAs for Flux D (?) and there seem to be some customized models as well?

Really just trying to get a feel for what's popular and most supported by the community right now. I plan to use Runpod and am more interested in quality than anything else - i.e. it's no problem if the model requires 32GB of VRAM or something if I'm just paying a couple of dollars for a few hours. But I don't want to start down the wrong track if suddenly SD3 has caught up and is the new SDXL, or if I should be using some particular popular Flux model or whatever.

Appreciate any pointing in the right direction. Currently trying to just figure out the new normal from reading and browsing but you know what a big area this is.

From what I can tell Pony XL's still very popular, though I suspect more people will move onto Illustrious (also SDXL-based) or Flux once it's easier to generate bobs and vagene.

I experimented a little with Pony-derived models last night: is there some model unloading that goes on once image generation is done? When I generate an image, it causes my computer to slow down (probably from sucking up RAM from loading up the desired model), but if I continue to generate images it seems to go faster without much slowdown.
 
Pretty much what @Baraadmirer posted. Flux is still really neat but limited. I, personally, don’t generate a lot of smut so being able to use GGUFs and prompt naturally feels awesome. Flux is great ant creating surreal and beautiful landscapes, which is my primary application.
Illustrious is super powerful out of the box and is definitely worth tinkering with. Very anime heavy, but it is also very sane. Like mentioned above, I can easily prompt to have a ln armor-clad dude swinging a war hammer by just describing what’s on my mind. Most of the models I’ve tried handle the generation of intricate details, hands and facial features quite well. I’m impressed. I haven’t touched any PonyXL stuff since September.

I highly recommend the AI Brews newsletter (AI Brews Substack). It’s a brief and bite-sized digest that comes in once a week that highlights the happenings of the AI world.

Edit: re: model unloading. I experience this in Comfy, but not A1111/Forge.
 
If anyone's looking for an image upscaler/resizer for LoRA training dataset curation, this service seems to be quite helpful. Only annoying thing is that if you're trying to do batch upscaling, it'll fix all the images to the first one's aspect ratio if you decide to check "lock aspect ratio".
 
  • Informative
Reactions: ddddank
From what I can tell Pony XL's still very popular, though I suspect more people will move onto Illustrious (also SDXL-based) or Flux once it's easier to generate bobs and vagene.

I experimented a little with Pony-derived models last night: is there some model unloading that goes on once image generation is done? When I generate an image, it causes my computer to slow down (probably from sucking up RAM from loading up the desired model), but if I continue to generate images it seems to go faster without much slowdown.

Pretty much what @Baraadmirer posted. Flux is still really neat but limited. I, personally, don’t generate a lot of smut so being able to use GGUFs and prompt naturally feels awesome. Flux is great ant creating surreal and beautiful landscapes, which is my primary application.
Illustrious is super powerful out of the box and is definitely worth tinkering with. Very anime heavy, but it is also very sane. Like mentioned above, I can easily prompt to have a ln armor-clad dude swinging a war hammer by just describing what’s on my mind. Most of the models I’ve tried handle the generation of intricate details, hands and facial features quite well. I’m impressed. I haven’t touched any PonyXL stuff since September.

I highly recommend the AI Brews newsletter (AI Brews Substack). It’s a brief and bite-sized digest that comes in once a week that highlights the happenings of the AI world.

Edit: re: model unloading. I experience this in Comfy, but not A1111/Forge.
Thanks both. I'm honestly a little disappointed that things haven't taken off more. I guess I'm mainly still just going back to SDXL, then. I'd kind of hoped we'd had another leap forward as Flux seemed pretty cool when it was released. Well, is pretty cool - but still lacking the sort of community support SDXL has. And SD3 doesn't appear to be taking off very well from what I can see.

I've never used Pony because although it's apparently one of the most developed models it seems to be primarily focused around porn. And even when not, still seems quite based in that. I checked out Illustrious, thanks for the recommendation, but it's too Anime for what I want. That substack is a great read though - thanks a lot for that.
 
  • DRINK!
Reactions: Looney Troons
I come back every few weeks/months, am in awe what is possible and then proceed to have absolutely no usecase whatsoever for it. I wish I had this stuff twenty years ago, when I needed graphics assets for vidya I was writing. It would have been a major motivator to actually continue.
 
  • Agree
Reactions: ddddank
Back