this post was submitted on 10 Sep 2023

5 points (100.0% liked)

Creative

4280 readers

3 users here now

Beehaw's section for your art and original content, other miscellaneous creative works you've found, and discussion of the creative arts and how they happen generally. Covers everything from digital to physical; photography to painting; abstract to photorealistic; and everything in between.

(It's not mandatory, but we also encourage providing a description of your image(s) for accessibility purposes! See here for a more detailed explanation and advice on how best to do this.)

Subcommunities on Beehaw:

Writing

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago

MODERATORS

alyaza@beehaw.org

Kamirose@beehaw.org

remington@beehaw.org

City Guardian (AI assisted) (i.imgur.com)

submitted 1 year ago by SpeakingColors@beehaw.org to c/creative@beehaw.org

13 comments fedilink hide all child comments

I’ve been diving into AI assisted workflows and found an extreme font for creativity. My recent efforts have been towards RPG-style characters like you’d see in a D&D game, and this guy came from the idea of a royal guard of an ancient city, Egyptian/African-esque. The AI gave me a variation with just the shield and I really liked the aspect of not killing but defending. If anyone is curious about the workflow I’d be happy to share :)

top 13 comments

sorted by: hot top controversial new old

[–] novibe@lemmy.ml 1 points 1 year ago (1 children)

Just curious what the “assistance” part of things was?

[–] SpeakingColors@beehaw.org 2 points 1 year ago

For sure! Often I’ll come in with a visual idea already, or will iterate on some with the AI giving inspiration. If I have the idea strongly I’ll sketch out the composition and elements I know I want - sometimes on real tricky poses like fingers I’ll take a photo of myself doing them. Throw that into stable diffusion with img-2-img to generate images based on my sketch/photograph to something more full featured or something I hadn’t thought of but really like (you can also set how “dreamy” the AI should be, how much it should vary from the input material).

There’s a lot of detail I could get into but the “assistance” is fleshing out a composition -> I go in and correct anatomical mistakes or elements I want to change specifically -> run it through again if it needs it.

[–] BuckShot686@beehaw.org 1 points 1 year ago* (last edited 1 year ago)

It looks amazing, I just can't shake how much the characters face resembles the lyricist Ryan Caraveo!

Edit: Reference photo & song: https://yewtu.be/watch?v=xikczrPm_1Q&listen=false

[–] kat@feddit.de 0 points 1 year ago (1 children)

I'm a little bit conflicted honestly, because I'm not a fan of the idea of A.I. art, in general. But I also have to admit this result looks really good. Very cool character and perspective! I also like your take with him being a defender/carrying no weapon.

I think maybe it needs some more work in the surroundings, because some of the buildings don't make sense (missing walls e.g. around his shield).

[–] SpeakingColors@beehaw.org 1 points 1 year ago

I hear you, when this stuff was blowing up I couldn’t shake that it was trained off artists’ work that they didn’t consent to having in the datasets. Sure it’s similar to how human artists work (for music and art the prevailing recommendations for me, or any artist, was to consume material relevant to your art. For visual art they really just wanted you to constantly keep your head open for shapes and form) but it felt closer to plagiarism than inspiration. Some generations can be very close to an individual style (especially if the model was trained specifically off that) but I found that generations that omitted an artist ended up creating something compelling but not tied to one artist specifically - still undoubtedly a conglomeration of the multitudes it was trained on (including photography). It’s muddy water for sure, and the angle of AI replacing workers in general is still relevant - but I also think it empowers people like me who have the visual ideas but can use the help making them fully fleshed out.

The crux, for me, feels like “when you can see whatever you want, what do you want to see?” A lot of our AI woes are reflections of questionable human behavior (racist chat models, AI for war, deepfakes and dishonesty).

How do you feel about it?

[–] Hundun@beehaw.org 0 points 1 year ago (1 children)

I've been meaning to get into AI-assisted graphic workflows, but haven't found anything useful aside from basic tutorials on how to set up SD and use it with basic prompts.

Can you share some leaning resources, perhaps a workflow one could steal?

[–] SpeakingColors@beehaw.org 0 points 1 year ago (1 children)

I replied to a previous comment about the “assistance” part which is sorta an abridged version of my workflow (“workflow” is also a term used in Comfy UI, a visual layout that processes the image sequentially through modules). It’s super fun I highly recommend it! Feel free to PM me anytime I’d be glad to help!

Really it was looking up terms and areas of Automatic 1111 I was unsure of and finding various sites and guides. Civitai has LOTS of guides often written by model makers or people with lots of hours in the field - it’s also my main resource for LoRAs and Models. But there’s tons of info on there. The most helpful ones where settings and workflows on actual image generation (I can definitely find some links for you there) to get quality results without too much “and if I change this, what happens?” But honestly I love poking around like that so I still spend hours tweaking just to see what happens xD

[–] Hundun@beehaw.org 1 points 1 year ago

Thank you for answering!

[–] Anamana@feddit.de 0 points 1 year ago (1 children)

Looks nice & cool concept. What's the workflow? :)

[–] SpeakingColors@beehaw.org 0 points 1 year ago (1 children)

Thank you! Essentially I’ll come in with a visual idea, some sketches already or I’ll do one with AI in mind (keep the lines simple so it doesn’t get confused). Generate a batch of images with img-2-img and cherry pick the ones that fit closest to the idea or are surprising and wonderful. Rework those for anatomical errors or other things I want to fix or omit -> send it back through img-2-img if it needs it or to inject detail -> upscale and put it as my desktop/phone wallpaper :P

(I’m using Automatic 1111 which is a webui for Stable Diffusion btw)

[–] Anamana@feddit.de 0 points 1 year ago (1 children)

Oh img2img sounds awesome. Never heard about it before. How do you rework the anatomical errors and stuff? By hand? Or all within Automatic 1111?

[–] SpeakingColors@beehaw.org 0 points 1 year ago (1 children)

Img2img is one of many ways to constrain the AIs efforts to your compositional desires, it’s rad. You can control the amount of “dreaming” the AI does on the base image to get subtle changes, or a radically different image based on the elements of the previous (sometimes to trippy cool results, often to horrendous mutations if the desired image is supposed to be humanoid xD).

Inpainting is another tool, it’s like a precise img2img on an area you mask. Hands are often the most garbled thing from the AI, so a brute force technique is to img2img the hands - but the process works a lot better if you help the AI out and manually fix the hands. So I’ll throw the image into photoshop, make a list (if I remember :P) of everything I need to fix, address them directly and then toss it back into Automatic 1111. Often the shading and overall style are hard things for me to get right so I’ll inpaint over my edits to get the style and shading back.

[–] Anamana@feddit.de 1 points 1 year ago

Thanks for explaining! sounds like a fun workflow :)