Good video as usual. Is it possible to use these vision models to extract information from passports image or financial statement images? If yes, we need a video for that.
You could extract data from any identification as long the text is visible. It's pretty similar to the part in the video where I extracted data from an image of a table of survey data. You would just need to adjust your prompt and if necessary use structured JSON outputs.
Good video as usual. Is it possible to use these vision models to extract information from passports image or financial statement images? If yes, we need a video for that.
You could extract data from any identification as long the text is visible. It's pretty similar to the part in the video where I extracted data from an image of a table of survey data.
You would just need to adjust your prompt and if necessary use structured JSON outputs.
@OnyxStudiosInteractive Most of LLMs are sensored and personal information in any Identification images are not alloweded to be extracted.
@ I can understand that, they don't want their models to be used to collect sensitive and personal data without user permission.