gpt-4-vision

Here are 79 public repositories matching this topic...

lancedb / vectordb-recipes

Resource, examples & tutorials for multimodal AI, RAG and agents using vector search and LLMs

machine-learning ai deep-learning embeddings openai gpt agents fine-tuning multimodal rag vector-database llms langchain llama-index lancedb gpt-4-vision multimodal-ai

Updated Apr 13, 2026
Jupyter Notebook

TypingMind / typingmind

Star

The most advanced Web UI for AI chat

gemini webui claude gpt-4 chatgpt chatgpt-ui typingmind claude2 gpt-4-turbo gpt-4-vision gemini-pro

Updated Apr 23, 2026
HTML

Skythinker616 / gpt-assistant-android

Star

【新增智能体模式】安卓端全场景GPT助手，可用音量键唤起并进行语音交流，支持联网、拍照、模板、附件解析、智能体模式等 | GPT assistant for Android, activated via volume keys for voice interaction, supporting features such as networking, taking photos, templates, parsing PDF and Office documents, and agent mode.

android agent markdown accessibility assistant gpt vlm llm chatgpt free-gpt gpt-4-vision

Updated Apr 19, 2026
Java

SkalskiP / sports

Star

Cool experiments at the intersection of Computer Vision and Sports ⚽🏃

tutorial deep-neural-networks computer-vision deep-learning pytorch object-detection sports-analytics yolov5 gpt-4 yolov7 prompt-engineering gpt-4-vision

Updated Dec 12, 2023
Jupyter Notebook

tbckr / sgpt

Star

SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.

go shell bash cli gemini openai gemini-api gpt-3 gpt-4 anthropic anthropic-claude openrouter gpt-4-vision-preview gpt-4-vision gemini-pro gpt-4o o1-mini o1-preview openrouter-api

Updated Apr 23, 2026
Go

WisconsinAIVision / ViP-LLaVA

Star

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

chatbot llama multi-modal clip vision-language gpt-4 foundation-models visual-prompting llava llama2 cvpr2024 gpt-4-vision

Updated Jul 17, 2024
Python

vdutts7 / gpt4V-scraper

Star

AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.

web-scraping browser-automation ai-agents puppeteer gpt-4-vision

Updated Mar 1, 2026
JavaScript

developersdigest / ai-devices

Sponsor

Star

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

tts openai whisper groq llm langchain llava function-calling langsmith gpt-4-vision serper llama3

Updated Jul 22, 2024
TypeScript

davidmigloz / pixels2flutter

Sponsor

Star

Convert a screenshot to a working Flutter app.

openai flutter llms gpt-4-vision

Updated Apr 1, 2025
Dart

ktutak1337 / Stellar-Chat

Star

A versatile multi-modal chat application that enables users to develop custom agents, create images, leverage visual recognition, and engage in voice interactions. It integrates seamlessly with local LLMs and commercial models like OpenAI, Gemini, Perplexity, and Claude, and allows to converse with uploaded documents and websites.

Updated Sep 4, 2024
C#

animalnots / BetterChatGPT-PLUS

Star

Maintained version of bettergpt. An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux). https://discord.gg/2CKfAbAJrH

ai chatbot prompt openai free prompt-toolkit gpt gpt-3 gpt-4 prompt-engineering chatgpt gpt-35-turbo better-chat-gpt llm-framework gpt-4-vision gpt-4o betterchatgpt

Updated Aug 10, 2025
TypeScript

nateraw / openai-vision-api-for-videos

Star

Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦

python machine-learning openai colab-notebook gpt-4 chatgpt gpt-4-vision

Updated Nov 7, 2023
Jupyter Notebook

signebedi / gptty

Star

ChatGPT wrapper in your TTY

python shell package query chatroom chatbot tty click openai gpt-3 openai-api gpt-4 chatgpt chatgpt-api gpt-35-turbo gpt-4-turbo gpt-4-vision

Updated Feb 29, 2024
Python

GianfrancoCorrea / gpt-4-vision-chat

Star

GPT 4 Turbo Vision with Chainlit

gpt-4 chainlit gpt-4-turbo gpt-4-vision

Updated Nov 27, 2023
Python

Badim41 / network_tools

Star

API | GPT-5, GML-4.5, VEO-3, Kling, gpt-4o, Claude 4 opus, command a, Recraft v3, Dalle-3, Stable Diffusion, Flux, Kandinsky, Suno V4.5, Hailuo, TTS

Updated Apr 2, 2026
Jupyter Notebook

supershaneski / chatgpt-with-image-sample

Star

This sample project integrates OpenAI's GPT-4 Vision, with advanced image recognition capabilities, and DALL·E 3, the state-of-the-art image generation model, with the Chat completions API. This powerful combination allows for simultaneous image creation and analysis.

reactjs nextjs chatbot openai image-analysis openai-api chatgpt openai-chatgpt function-calling chatgpt-image dall-e-3 gpt-4-vision-preview gpt-4-vision

Updated Nov 22, 2023
JavaScript

neka-nat / mylangrobot

Sponsor

Star

Language instructions to mycobot using GPT-4V

whisper mycobot chatgpt segment-anything gpt4v gpt-4-vision-preview gpt-4-vision

Updated Dec 11, 2023
Python

jeremy-collins / gpt4v-screenshot-analyzer

Star

This tool offers an interactive way to analyze and understand your screenshots using OpenAI's GPT-4 Vision API. Capture any part of your screen and engage in a dialogue with ChatGPT to uncover detailed insights, ask follow-up questions, and explore visual data in a user-friendly format.

screenshot ai computer-vision chatbot gpt-4 chatgpt gpt-4-vision

Updated Aug 8, 2024
Python

philfung / awesome-computer-use

Star

Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.

computer-vision vision rpa tool-use llm anthropic anthropic-claude gpt-4-vision rpa-robotic-process-automation gui-agents computer-use

Updated Mar 18, 2026

LazaUK / AOAI-GPT4Vision-Streamlit-SDKv1

Star

Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.

ai azure openai gpt out-of-stock streamlit gpt-4-vision

Updated Jan 3, 2024
Python

Improve this page

Add a description, image, and links to the gpt-4-vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpt-4-vision topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpt-4-vision

Here are 79 public repositories matching this topic...

lancedb / vectordb-recipes

TypingMind / typingmind

Skythinker616 / gpt-assistant-android

SkalskiP / sports

tbckr / sgpt

WisconsinAIVision / ViP-LLaVA

vdutts7 / gpt4V-scraper

developersdigest / ai-devices

davidmigloz / pixels2flutter

ktutak1337 / Stellar-Chat

animalnots / BetterChatGPT-PLUS

nateraw / openai-vision-api-for-videos

signebedi / gptty

GianfrancoCorrea / gpt-4-vision-chat

Badim41 / network_tools

supershaneski / chatgpt-with-image-sample

neka-nat / mylangrobot

jeremy-collins / gpt4v-screenshot-analyzer

philfung / awesome-computer-use

LazaUK / AOAI-GPT4Vision-Streamlit-SDKv1

Improve this page

Add this topic to your repo