https://github.com/gitcoder052023/build-with-gemini
This Gemini Repository uses Google's Gemini models to perform various tasks, and shows usage, and Project ideas for students and researchers, and this repo gives an ideas to use Google pre-trained model Gemini to make things easier.
https://github.com/gitcoder052023/build-with-gemini
ai artificial-intelligence gemini gemini-pro-vision generative-ai generativeai ml python
Last synced: about 1 month ago
JSON representation
This Gemini Repository uses Google's Gemini models to perform various tasks, and shows usage, and Project ideas for students and researchers, and this repo gives an ideas to use Google pre-trained model Gemini to make things easier.
- Host: GitHub
- URL: https://github.com/gitcoder052023/build-with-gemini
- Owner: GitCoder052023
- Created: 2023-12-28T07:58:18.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-03T14:32:43.000Z (over 1 year ago)
- Last Synced: 2025-04-30T13:44:36.137Z (about 1 month ago)
- Topics: ai, artificial-intelligence, gemini, gemini-pro-vision, generative-ai, generativeai, ml, python
- Language: Python
- Homepage:
- Size: 64.9 MB
- Stars: 6
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Introduction
**Build-with-Gemini** is a comprehensive set of Python Projects and ideas which uses Google's latest and most capable models **Gemini-pro** and **Gemini-pro vision** to perform various tasks and projects. These tasks include text-to-speech conversion, interactive chat functionalities, and sophisticated image and video processing, these scripts empower the user to seamlessly perform AI-driven tasks such as dynamic chatting, content generation, image recognition, and video analysis.
# Key Features:
- Text-to-Speech Conversion: Utilizing advanced capabilities for converting text into natural-sounding speech.- Interactive Chat Interaction: Engaging chat functionalities that enhance user experience and interaction.
- Image and Video Processing: Powerful scripts for image recognition and video analysis, tapping into the potential of Gemini-pro-vision.
- Content Generation: Harnessing the prowess of Google's Gemini models to generate diverse and dynamic content.
# Educational Insights:
Gemini not only serves as a practical toolkit for AI enthusiasts but also offers valuable insights and project ideas for students. It provides a platform for hands-on learning and experimentation with Google's Gemini models, inspiring students to delve into exciting AI projects and applications.Explore the limitless possibilities of AI with Gemini and embark on a journey of innovation and discovery in the realm of artificial intelligence.
# Setup and Installation of Gemini
# Setting up API KEY
To use the API, we have to first get an API key that you can can from here: https://ai.google.dev/tutorials/setup
After that click on “Get an API key” button and then click on “Create API key in new project”.
Copy the API key and set it as an environment variable. We are using Deepnote and it is quite easy for us to set the key with the name you want. Just go to the integration, scroll down and select environment variables.
# Dependencies of Gemini
To use these scripts, you will need to install the following dependencies:pathlib
textwrap
google.generativeai
IPython.display
PIL.Image
cv2
pyttsx3Now, to install the required modules, you can use this following pip command in your terminal or command prompt:
```bash
pip install google-generativeai pyttsx3 opencv-python
```You will also need to configure the Google Generative AI models with your API key. And you will will need to install "Python 3.10" to use Gemini.
# follow these steps to clone Gemini:
To clone gemini you can use this pip command:
```bash
git clone https://github.com/GitCoder052023/Build-with-Gemini # clone
cd Build-with-Gemini
pip install -r requirements.txt # install
```