Joycaption - Image to prompt for new perchance model

AdComfortable1514@lemmy.world · 2 months ago

Feel free to try Joycaption for captioning : https://lemmy.world/post/30096816

AdComfortable1514@lemmy.world · 2 months ago

Joycaption - Image to prompt for new perchance model

AdComfortable1514@lemmy.world · 3 months ago

Was discussing this with a friend yesterday.

Civitai has a long history of bad moderation.

But bad moderation does not mean that you need stricter moderation.

Rather , civitai’s moderation methods has alienated normal users from engaging with the website in a positive manner.

Give users the ability to generate what they want privately , on the condition they keep it nice and tidy on the front end

and people will have incentive and capacity to restrict themselves in the public space.

Treat people fairly , they open up to you about the problems they see.

Set clear boundries on rules , and users can work the problem for you; ensuring they get what they need without posing problem for the website.

If normal users leave , or become dormant , then there will be nobody to keep the crazies in line.

Referring here to people with kinks and motivations that fall well out of scope of what one normally has to moderate on a webpage.

Once the normal users become inactive , the entire community becomes a madhouse.

Which I think is more or less is what has happened to civitai at this point.

Active users are crazy , the normal users are dormant , moderators have to work their *ss off to regulate everything 24/7 , website becomes a trash heap and nobody cares about anything anymore.

AdComfortable1514@lemmy.world · edit-2 9 months ago

My solution has been to left click , select “inspect element” to open the browsers HTML window.

Then zoom out the generator as far as it goes , and scroll down so the entire image gallery (or a part of it at least) is rendered within the browser.

The ctrl+c copy the HTML and paste it in notepad++ , and use regular expressions to sort out the image prompts (and image source links) from the HTML code

Not exactly a good fix , but it gets the job done at least.

AdComfortable1514@lemmy.world · 9 months ago

Big ask.

Personally , I’d be overjoyed to just have embeddings available on the site.

AdComfortable1514@lemmy.world · edit-2 9 months ago

1/20th of civitai user prompts added + search prompts using CLIP (Jupyter Notebook)

AdComfortable1514@lemmy.world · edit-2 9 months ago

It is good that you ask :)!

Read this: https://arxiv.org/abs/2406.02965

The tldr:

Negatives should be ‘things that appear in the image’ .

If you prompt a picture of a cat , then ‘cat’ or ‘pet’ can be useful items to place in the negative prompt.

Best elimination with adjective words is with 16.7% delay , by writing \[ : neg1 neg2 neg3 :0.167 \] instead of neg1 neg2 neg3

Best elimination for noun words is with 30% delay , by writing “\[ : neg1 neg2 neg3 :0.3 \]” in the negative prompt instead of "neg1 neg2 neg3 "

AdComfortable1514@lemmy.world · edit-2 10 months ago

I appreciate you took the time to write a sincere question.

Kinda rude for people to downvote you.

AdComfortable1514@lemmy.world · edit-2 10 months ago

Simple and cool.

Florence 2 image captioning sounds interesting to use.

Do people know of any other image-to-text models (apart from CLIP) ?

AdComfortable1514@lemmy.world · 10 months ago

Wow , yeah I found a demo here: https://huggingface.co/spaces/Qwen/Qwen2.5

A whole host of LLM models seems to be released. Thanks for the tip!

I’ll see if I can turn them into something useful 👍

AdComfortable1514@lemmy.world · 10 months ago

That’s good to know. I’ll try them out. Thanks.

AdComfortable1514@lemmy.world · edit-2 10 months ago

Hmm. I mean the FLUX model looks good

, so there must maybe be some magic with the T5 ?

I have no clue, so any insights are welcome.

T5 Huggingface: https://huggingface.co/docs/transformers/model_doc/t5

T5 paper : https://arxiv.org/pdf/1910.10683

Any suggestions on what LLM i ought to use instead of T5?

AdComfortable1514@lemmy.world · edit-2 10 months ago

Good find! Fixed. It was well appreciated.

AdComfortable1514@lemmy.world · edit-2 10 months ago

[Dev Diary] More sets added to the NND CLIP interrogator

AdComfortable1514@lemmy.world · 10 months ago

Fair enough

AdComfortable1514@lemmy.world · edit-2 10 months ago

I get it. I hope you don’t interpret this as arguing against results etc.

What I want to say is ,

If implemented correctly , same seed does give the same result for output for a given prompt.

If there is variation , then something in the pipeline must be approximating things.

This may be good (for performance) , or it may be bad.

You are 100% correct in highlighting this issue to the dev.

Though its not a legal document , or a science paper.

Just a guide to explain seeds to newbies.

Omitting non-essential information , for the sake of making the concept clearer , can be good too.

AdComfortable1514@lemmy.world · edit-2 10 months ago

Perchance dev is correct here Allo ;

the same seed will generate the exact same picture.

If you see variety , it will be due to factors outside the SD model. That stuff happens.

But it’s good that you fact check stuff.

AdComfortable1514@lemmy.world · 10 months ago

Do you know where I can find documemtation on the perchance API?

Specifically createPerchanceTree ?

I need to know which functions there are , and what inputs/outputs they take.

AdComfortable1514@lemmy.world · 10 months ago

Thanks! I appreciate the support. Helps a lot to know where to start looking ( ; v ;)b!

AdComfortable1514@lemmy.world · edit-2 10 months ago

I get this error from dynamic imports. Ideas?

AdComfortable1514@lemmy.world · 10 months ago

Cool

AdComfortable1514@lemmy.world · edit-2 10 months ago

Making a better CLIP interrogator with the FLUX T5 encoder?

AdComfortable1514@lemmy.world · 10 months ago

New stuff

Paper: https://arxiv.org/abs/2303.03032

Takes only a few seconds to calculate.

AdComfortable1514@lemmy.world · edit-2 10 months ago

I count casualty_rate = number_shot / (number_shot + number_subdued)

Which in this case is 22/64 = 34% casualty rate for civilians

and 98/131 = 75% casualty rate for police

AdComfortable1514@lemmy.world · 10 months ago

So its 64-131 between work done by bystanders vs. work done by police?

And casualty rate is actually lower for bystanders doing the work (with their guns) than the police?

AdComfortable1514@lemmy.world · edit-2 10 months ago

Prompt+Token % Similarity Calculator. No GPU required.

AdComfortable1514@lemmy.world · edit-2 11 months ago

[Request] [T2i] Active model selection please!

AdComfortable1514@lemmy.world · edit-2 11 months ago

[Question] What SD 1.5 model does Perchance currently use?

AdComfortable1514@lemmy.world · edit-2 1 year ago

[T2i][Tool] [Request] CLIP Interrogator - create prompt from an image

AdComfortable1514@lemmy.world · edit-2 1 year ago

[Request] Add (ClipSkip:::2) option to text-to-image

AdComfortable1514@lemmy.world · 1 year ago

[Help] How to check if generator name exists

AdComfortable1514@lemmy.world · edit-2 1 year ago

[Request] Create a "Oops Something went wrong" page if a generator fails to load on Perchance

AdComfortable1514@lemmy.world · edit-2 1 year ago

[Help] Two-tier selection with dynamic imports

AdComfortable1514@lemmy.world · edit-2 1 year ago

[Request] AI Image gallery features - Add comments section below images + other stuff

AdComfortable1514@lemmy.world · edit-2 1 year ago

[Help] dynamic-import for large datasets occasionally reads values as 'undefined'

AdComfortable1514@lemmy.world · edit-2 1 year ago

[Request] Loading/Unloading imported datasets

AdComfortable1514@lemmy.world · edit-2 1 year ago

Fusion Image Generator 🔸✨📝🔰🔹 (perchance t2i-generator)

Moderates