Upload folder using huggingface_hub

Browse files

Files changed (6) hide show

README.md +8 -75
feature_extractor/preprocessor_config.json +3 -3
flux1-redux-dev.safetensors +2 -2
image_embedder/diffusion_pytorch_model.safetensors +2 -2
image_encoder/config.json +4 -4
image_encoder/model.safetensors +2 -2

README.md CHANGED Viewed

@@ -4,89 +4,22 @@ language:
 license: other
 license_name: flux-1-dev-non-commercial-license
 license_link: LICENSE.md
-extra_gated_prompt: By clicking "Agree", you agree to the [FluxDev Non-Commercial License Agreement](https://huggingface.co/black-forest-labs/FLUX.1-Depth-dev/blob/main/LICENSE.md)
-  and acknowledge the [Acceptable Use Policy](https://huggingface.co/black-forest-labs/FLUX.1-Depth-dev/blob/main/POLICY.md).
 tags:
 - image-generation
 - flux
-- diffusion-single-file
 ---
 ![image/png](https://huggingface.co/black-forest-labs/FLUX.1-Redux-dev/resolve/main/redux.png)
-FLUX.1 Redux [dev] is an adapter for all FLUX.1 base models for image variation generation.
-Given an input image, FLUX.1 Redux can reproduce the image with slight variation, allowing to refine a given image.
-It naturally integrates into more complex workflows unlocking image restyling.
-Restyling via text is also available through our API by providing an image plus a language prompt.
-For more information, please read our [blog post](https://blackforestlabs.ai/flux-1-tools/).
-# Usage
-We provide a reference implementation of `FLUX.1 Redux [dev]`, as well as sampling code, in a dedicated [github repository](https://github.com/black-forest-labs/flux).
-## API Endpoints
-`FLUX.1 Redux [pro]` is available in our API [bfl.ml](https://docs.bfl.ml/). In addition to the `[dev]` adapter, the API endpoint allows users to modify an image given a textual description.
-The feature is supported in our latest model FLUX1.1 [pro] Ultra, allowing for combining input images and text prompts to create high-quality 4-megapixel outputs with flexible aspect ratios.
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/64510d6304397681bcf9725b/P123igomKjAkmitOzot8T.png)
-## Diffusers
-To use `FLUX.1 Redux [pro]` with the 🧨 diffusers python library, first install or upgrade diffusers
-```shell
-pip install -U diffusers
-```
-Then you can use `FluxPriorReduxPipeline` along with `FluxPipeline` to generate images from images.
-```python
-import torch
-from diffusers import FluxPriorReduxPipeline, FluxPipeline
-from diffusers.utils import load_image
-pipe_prior_redux = FluxPriorReduxPipeline.from_pretrained("black-forest-labs/FLUX.1-Redux-dev", torch_dtype=torch.bfloat16).to("cuda")
-pipe = FluxPipeline.from_pretrained(
-    "black-forest-labs/FLUX.1-dev" ,
-    text_encoder=None,
-    text_encoder_2=None,
-    torch_dtype=torch.bfloat16
-).to("cuda")
-image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/robot.png")
-pipe_prior_output = pipe_prior_redux(image)
-images = pipe(
-    guidance_scale=2.5,
-    num_inference_steps=50,
-    generator=torch.Generator("cpu").manual_seed(0),
-    **pipe_prior_output,
-).images
-images[0].save("flux-dev-redux.png")
-```
-To learn more check out the [diffusers](https://huggingface.co/docs/diffusers/main/en/api/pipelines/flux) documentation
 ---
-# Limitations
-- This model is not intended or able to provide factual information.
-- As a statistical model this checkpoint might amplify existing societal biases.
-- The model may fail to generate output that matches the prompts.
-- Outputs are heavily influenced by the input image.
-# Out-of-Scope Use
-The model and its derivatives may not be used
-- In any way that violates any applicable national, federal, state, local or international law or regulation.
-- For the purpose of exploiting, harming or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
-- To generate or disseminate verifiably false information and/or content with the purpose of harming others.
-- To generate or disseminate personal identifiable information that can be used to harm an individual.
-- To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
-- To create non-consensual nudity or illegal pornographic content.
-- For fully automated decision making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
-- Generating or facilitating large-scale disinformation campaigns.
-# License
-This model falls under the [`FLUX.1 [dev]` Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-Depth-dev/blob/main/LICENSE.md).

 license: other
 license_name: flux-1-dev-non-commercial-license
 license_link: LICENSE.md
 tags:
 - image-generation
 - flux
+- siglip2
 ---
 ![image/png](https://huggingface.co/black-forest-labs/FLUX.1-Redux-dev/resolve/main/redux.png)
+# Flex.1-alpha-Redux [dev] — Enhanced with SigLIP-2
+FLUX.1 Redux [dev] is an adapter for all FLUX.1 base models for image variation generation.
+Given an input image, FLUX.1 Redux can reproduce the image with slight variation, allowing for refined visual generation. It integrates naturally into more complex workflows, enabling image restyling and editing via text.
+This version of FLUX.1 Redux has been updated with modules from [`google/siglip2-so400m-patch16-512`](https://huggingface.co/google/siglip2-so400m-patch16-512), replacing the:
+- `image_encoder`
+- `feature_extractor`
+- `image_embedder`
 ---

feature_extractor/preprocessor_config.json CHANGED Viewed

@@ -15,10 +15,10 @@
     0.5
   ],
   "processor_class": "SiglipProcessor",
-  "resample": 3,
   "rescale_factor": 0.00392156862745098,
   "size": {
-    "height": 384,
-    "width": 384
   }
 }

     0.5
   ],
   "processor_class": "SiglipProcessor",
+  "resample": 2,
   "rescale_factor": 0.00392156862745098,
   "size": {
+    "height": 512,
+    "width": 512
   }
 }

flux1-redux-dev.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a1b3bdcb4bdc58ce04874b9ca776d61fc3e914bb6beab41efb63e4e2694dca45
-size 129063232

 version https://git-lfs.github.com/spec/v1
+oid sha256:8a422b7a55b1fb7232af85bd4d6c5a26f763879734e856e8661815016e60f161
+size 129008400

image_embedder/diffusion_pytorch_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:02ace6d3b9dc6fa1ab77e6863151430a3ff128f0d0e378021ab9bcb7f2ed18f0
-size 129008000

 version https://git-lfs.github.com/spec/v1
+oid sha256:8a422b7a55b1fb7232af85bd4d6c5a26f763879734e856e8661815016e60f161
+size 129008400

image_encoder/config.json CHANGED Viewed

@@ -5,14 +5,14 @@
   "attention_dropout": 0.0,
   "hidden_act": "gelu_pytorch_tanh",
   "hidden_size": 1152,
-  "image_size": 384,
   "intermediate_size": 4304,
   "layer_norm_eps": 1e-06,
   "model_type": "siglip_vision_model",
   "num_attention_heads": 16,
   "num_channels": 3,
   "num_hidden_layers": 27,
-  "patch_size": 14,
   "torch_dtype": "bfloat16",
-  "transformers_version": "4.45.2"
-}

   "attention_dropout": 0.0,
   "hidden_act": "gelu_pytorch_tanh",
   "hidden_size": 1152,
+  "image_size": 512,
   "intermediate_size": 4304,
   "layer_norm_eps": 1e-06,
   "model_type": "siglip_vision_model",
   "num_attention_heads": 16,
   "num_channels": 3,
   "num_hidden_layers": 27,
+  "patch_size": 16,
   "torch_dtype": "bfloat16",
+  "transformers_version": "4.51.3"
+}

image_encoder/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d769e3a32a6a9bac72d4d93b989e44491f71b50f02bfa14cd9187758d4a68ff1
-size 856506120

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc315325a08b9ada2eb1f2cf9fb1b6874defacf1f3b0ae8dc21bf021d6437db1
+size 857600520