Spaces:

mknolan
/

internvl25-image-analyzer-clean

Paused

App Files Files Community

mknolan commited on Mar 19

Commit

8e6ddeb

verified ·

1 Parent(s): 565626f

Copy from mknolan/internvl25-image-analyzer

Browse files

Files changed (1) hide show

README.md +54 -6

README.md CHANGED Viewed

@@ -1,10 +1,58 @@
 ---
-title: Internvl25 Image Analyzer Clean
-emoji: 👀
-colorFrom: red
-colorTo: yellow
-sdk: docker
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: InternVL2.5 Image Analyzer
+emoji: 🖼️
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: 3.50.0
+app_file: app.py
 pinned: false
 ---
+# InternVL2.5 Image Analyzer
+This Hugging Face Space demonstrates the capabilities of the [InternVL2.5 model](https://huggingface.co/OpenGVLab/InternVL2_5-8B), a powerful multimodal model that can analyze images and respond to questions about them.
+## Features
+- Upload your own images for analysis
+- Choose from predefined prompts or create your own
+- Detailed image understanding and description
+- Text recognition in images
+- Visual reasoning capabilities
+## Model Details
+This space uses the InternVL2.5-8B model, which is a multimodal large language model (MLLM) with approximately 8.1 billion parameters. The model was developed by OpenGVLab and demonstrates strong capabilities in various visual understanding tasks.
+### Architecture
+InternVL2.5 combines a vision encoder (based on the InternViT architecture) with a language model, allowing it to process both visual and textual information.
+## Example Prompts
+Here are some prompts you can try:
+1. Describe this image in detail.
+2. What can you tell me about this image?
+3. Is there any text in this image? If so, can you read it?
+4. What is the main subject of this image?
+5. What emotions or feelings does this image convey?
+6. Describe the composition and visual elements of this image.
+7. Summarize what you see in this image in one paragraph.
+## Usage
+1. Upload an image using the file uploader
+2. Select a prompt from the dropdown or write your own
+3. Click "Submit" to get the analysis
+## Credits
+This application uses the InternVL2.5 model by OpenGVLab. For more information about the model, check out:
+- [OpenGVLab/InternVL Repository](https://github.com/OpenGVLab/InternVL)
+- [InternVL Documentation](https://internvl.readthedocs.io/en/latest/)
+## License
+The InternVL2.5 model is licensed under the MIT License.