This is a Streamlit-based AI app that takes an image as input and generates a smart caption using the BLIP vision-language model from Hugging Face.
- Upload any .png, .jpg, or .jpeg image
- See the image and a generated caption in your browser
- Built with Streamlit, BLIP, PyTorch, and Transformers
pip install -r requirements.txt