Question re add-ons which provide similar functionality to JAWS Picture Smart


Amir
 

Folks, a friend of mine - who's a JAWS user - asked me about NVDA's picture description and analysis capabilities the other day. After couple of email exchanges I understood that he meant picture analysis/description rather than OCR - something akin to JFW's Picture Smart capabilities. However, my own investigation indicates that, at the time of this writing, NVDA can't provide such a functionality via add-ons. I mean it can't take a picture, analyze it via Google/Microsoft/other services, and provide a description. It seems that few add-ons were developed in the past to take care of this issue, but all of them have apparently been abandoned.
As such, am I correct in assuming that NVDA can't help users with image descriptions? If a solution exists, please kindly keep me in the loop.

Best,
Amir


Luke Davis
 

Amir wrote:

Folks, a friend of mine - who's a JAWS user - asked me about NVDA's picture description and analysis capabilities the other day. After couple of email
exchanges I understood that he meant picture analysis/description rather than OCR - something akin to JFW's Picture Smart capabilities. However, my own
I believe that Chromium based browsers can do this natively now. I don't think Firefox can, but I might be wrong.

NVDA has no facility for it that I know of. I'm really surprised Jaws does, because to my knowledge, you have to pay for that API access at an organizational level. Unless they just absorb it because of what they charge for the licenses.

Luke


Stefan Moisei
 

Hi,
see this:
it works, I am personally using it. I think it is slower than picture smart, but it gives more details, since it uses multiple API's.

------ Original Message ------
From: "Amir" <mobilespace08@...>
Sent: 22.03.2023 23:56:59
Subject: [nvda-addons] Question re add-ons which provide similar functionality to JAWS Picture Smart

Folks, a friend of mine - who's a JAWS user - asked me about NVDA's picture description and analysis capabilities the other day. After couple of email exchanges I understood that he meant picture analysis/description rather than OCR - something akin to JFW's Picture Smart capabilities. However, my own investigation indicates that, at the time of this writing, NVDA can't provide such a functionality via add-ons. I mean it can't take a picture, analyze it via Google/Microsoft/other services, and provide a description. It seems that few add-ons were developed in the past to take care of this issue, but all of them have apparently been abandoned.
As such, am I correct in assuming that NVDA can't help users with image descriptions? If a solution exists, please kindly keep me in the loop.

Best,
Amir


Amir
 

Thanks, Luke. Of course, not all Chromium based browsers can do it - apparently. It seems that Google Chrome can, but MS Edge doesn't provide such a feature - at least I haven't seen it. And the fact that Freedom Scientific pays for access to Google/Microsoft API is absolutely true. This is not a free service, after all. This is more or less similar to what they do for access to Omnipage for their OCR requirements.

Best,
Amir


Amir
 

Awesome recommendation, Stefan! Honestly I was thinking Cloud Vision was either abandoned or crippled by API access issues. It works very well here, so I'll also recommend it.
Thanks again.

Best,
Amir