Skip to content

[✨ FEATURE] Add screen text context read for open apps and/or OCR #448

Description

@benjaminTaubenblatt

Is your feature request related to a problem?
Bad transcription on technical words and function names etc.

Describe the solution you'd like
FluidVoice does an on device screen read, parses text, potentially does OCR and feeds that as context to the recognizer model to produce the right transcription.

Describe alternatives you've considered
Adding to dictionary and instant rules but it is impossible to do it for custom names etc that appear on screen

Additional context
Apparently, VoiceInk does this (proprietary)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions