Does this support text prompts?
#3
by
YaTharThShaRma999
- opened
Does the redux model support text prompts along with an image prompt or only an image prompt. Thanks for your new Lora’s and controlnets!
Same here. Please help
yes
import torch
from diffusers import FluxPriorReduxPipeline, FluxPipeline
from transformers import CLIPTextModel, CLIPTokenizer, T5EncoderModel, T5TokenizerFast
from diffusers.utils import load_image
device = "cuda"
dtype = torch.bfloat16
text_encoder = CLIPTextModel.from_pretrained(
"black-forest-labs/FLUX.1-dev",
subfolder="text_encoder",
torch_dtype=dtype,
)
text_encoder_2 = T5EncoderModel.from_pretrained(
"black-forest-labs/FLUX.1-dev",
subfolder="text_encoder_2",
torch_dtype=dtype,
)
tokenizer = CLIPTokenizer.from_pretrained(
"black-forest-labs/FLUX.1-dev",
subfolder="tokenizer",
)
tokenizer_2 = T5TokenizerFast.from_pretrained(
"black-forest-labs/FLUX.1-dev",
subfolder="tokenizer_2",
)
repo_redux = "black-forest-labs/FLUX.1-Redux-dev"
repo_base = "black-forest-labs/FLUX.1-dev"
pipe_prior_redux = FluxPriorReduxPipeline.from_pretrained(
repo_redux,
text_encoder=text_encoder,
text_encoder_2=text_encoder_2,
tokenizer=tokenizer,
tokenizer_2=tokenizer_2,
torch_dtype=dtype
).to(device)
pipe = FluxPipeline.from_pretrained(
repo_base,
torch_dtype=dtype
).to(device)
my_image= load_image("image.png")
pipe_prior_output = pipe_prior_redux(
my_image,
prompt="",
)
images = pipe(
guidance_scale=2.5,
num_inference_steps=50,
generator=torch.Generator("cpu").manual_seed(0),
**pipe_prior_output,
).images
Were you able to make any difference with the prompt. I see the same results. Seems it's ignored
Any news on this?
As per readme, "... the API endpoint allows users to modify an image given a textual description. The feature is supported in our latest model FLUX1.1 [pro] Ultra..."
So perhaps not possible to use text prompt along with image with publically available dev/schnell + redux repos.