| AppID: | com.rastislavkish.vscan |
| Author: | Rastislav Kish |
| License: | GPL-3.0-only |
| InRepoSince: | 2025-10-18 |
| LastRepoUpdate: | 2025-10-18 |
| LastAppUpdate: | Unknown |
| LastVersion: | 0.2.3 |
| Categories: | AI Chat, Multimedia |
| Google Play: | Check if it's there |
This is a little project of mine aiming to research how vision LLMs could help out blind people on travel and in their every-day life by substituting eyesight for various visual tasks. VScan turns your smartphone's camera into a device for visual perception. You can define various optical cognitive functions, like looking for objects, signs, evaluating a scene or simply mediating visual impressions. You can afterwards use these functions on the camera view, just like a sighted person would use their eyes to achieve a specific goal in the physical world.
Each cognitive tool consists of two major parts:
VScan is open-source software. Visit the project's official repository to learn more about its background, motivation, specific usage examples and setup instructions.
WhatsNew:
- There is now a standalone editor for entering the system prompt and user prompt. This editor has a large text field, which should make it easy to work with long and complex prompts.
- Various UI improvements and bug fixes.
Download (6.6 M)