Skip to content

๐ŸŒŒ Explore the wonders of image captioning with the Gemini Image Captioning Demo! Powered by Streamlit ๐Ÿ๐Ÿ”ง and Google's Gemini Pro API Vision ๐ŸŒŸ, effortlessly generate captivating captions for your uploaded images. ๐Ÿ“ธ๐Ÿ’ฌ

License

Notifications You must be signed in to change notification settings

sathwik-3721/Image-Captioning-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

7 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŒŒ Gemini Image Captioning Demo

Overview

This is a basic image captioning application using the ๐Ÿš€ Gemini Pro API Vision. It allows users to generate captions for uploaded images using the Gemini Pro model. ๐Ÿ“ธ๐Ÿ”ฎ

Getting Started

Prerequisites

  • Python 3.6 or later ๐Ÿ
  • pip (Python package installer) ๐Ÿ“ฆ
  • An API key for the ๐Ÿš€ Gemini Pro API Vision (obtained from Google) ๐Ÿ”‘

Installation

  1. Clone the repository:

    git clone https://github.com/sathwik-3721/gemini-image-captioning-demo.git
    cd gemini-image-captioning-demo
    
  2. Install the required dependencies:

    pip install -r requirements.txt
    
  3. Create a .env file in the root directory with your Google API key:

    GOOGLE_API_KEY=your_google_api_key_here
    
  4. Run the Streamlit app:

    streamlit run app.py
    
    

Usage

  1. Open the Streamlit app in your web browser. ๐ŸŒ
  2. Enter a caption or description in the input text box (optional). โœ๏ธ
  3. Upload an image (supported formats: jpg, jpeg, png). ๐Ÿ–ผ๏ธ
  4. Click the "Generate Caption about image" button. ๐Ÿš€
  5. View the generated caption below the image. ๐Ÿ“

Built With

  • Streamlit - Web framework for creating interactive web applications with Python. ๐Ÿ๐Ÿ”ง
  • Google Gemini Pro API Vision - Gemini Pro Vision API for generating content from images. ๐ŸŒŒ

License

This project is licensed under the MIT License - see the LICENSE file for details.๐Ÿ“œ

Acknowledgments

  • Special thanks to Google for providing the Gemini Pro API .

Made by Sathwik Ramisetty with โค๏ธ | ยฉ 2023 All rights reserved.

About

๐ŸŒŒ Explore the wonders of image captioning with the Gemini Image Captioning Demo! Powered by Streamlit ๐Ÿ๐Ÿ”ง and Google's Gemini Pro API Vision ๐ŸŒŸ, effortlessly generate captivating captions for your uploaded images. ๐Ÿ“ธ๐Ÿ’ฌ

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages