Video Analytics with Azure AI with a chatGPT like experience

A quick prototype of a video analytics solution to analyse content from a video with a chatGPT like experience using Azure Computer Vision for dense captionning and OCR, Azure Speech Services for speech to text, Azure Open AI and LangChain.

Process

Frames extraction from the video file
Generating dense captioning from the frames using Azure Computer Vision 4 (Florence)
Generating OCR from the frames using Azure Computer Vision 4 (Florence)
Extracting the audio part from the video and speech to text generation using Azure Speech Services
Use of Azure Open AI and LangChain
Storing the results into Faiss DB
Creating and using a bot for a chatGPT like experience
An example of a webapp using GRADIO

Documentation

Some examples

19-Apr-2023 Serge Retkowsky | serge.retkowsky@microsoft.com | https://www.linkedin.com/in/serger/

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
Video Analytics with Florence and Azure OpenAI.ipynb		Video Analytics with Florence and Azure OpenAI.ipynb
azure.env		azure.env
chatbot.gif		chatbot.gif
demovideo.mp4		demovideo.mp4
gradioapp.jpg		gradioapp.jpg
gradioapp2.jpg		gradioapp2.jpg
gradioapp3.jpg		gradioapp3.jpg
gradioapp4.jpg		gradioapp4.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Analytics with Azure AI with a chatGPT like experience

Process

Documentation

Some examples

About

Releases

Packages

Languages

retkowsky/Video_Analytics_with_AzureAI

Folders and files

Latest commit

History

Repository files navigation

Video Analytics with Azure AI with a chatGPT like experience

Process

Documentation

Some examples

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages