computer-vision

Here is a quick video overview of a computer vision task I have been working on. It includes object detection, image segmentation, and monocular depth estimation.

The idea started for me when a lecturer gave us a task to conceptualise and research an application of combining a language model with a computer vision model. After a little reading, I was shocked to learn that there are roughly 300 million people with moderate to severe vision impairment and 36 million who are completely blind. I asked myself the question: What vision models are available to build situational understanding?

Video Output Link

Depth Estimation with “Intel/dpt-hybrid-midas”

Object Classification was done with Ultrlytics Yolov8-Nano

Image Segmentation with "nvidia/segformer-b0-finetuned-ade-512-512"

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
blind_ai_git.ipynb		blind_ai_git.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

computer-vision

About

Releases

Packages

Languages

bwilkie/Predictive-Computer-Vision

Folders and files

Latest commit

History

Repository files navigation

computer-vision

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages