Released: OCR Pack

Rickard_Abraham · October 13, 2023, 2:52am

Hey everyone,
Today I’m launching the OCR Pack!

It lets you read text in images and PDFs

It’s taken a lot of attempts, research, and experimenting - I’m stoked for this solution and I hope you are too!

Here’s a doc with examples

Scott_Collier-Weir · October 13, 2023, 3:12am

So exciting! Been wanting something like this for a while

You rock!

Christiaan_Huizer · October 13, 2023, 6:51am

Hi @Rickard_Abraham , this could be interesting.
Would you mind creating a video showing some use cases?
Don’t hesitate to share your story in more detail, what looks easy, rarely is.
cheers, Christiaan

Bill_French · October 13, 2023, 11:43am

I can imagine. Some late nights dialing this in I’m sure.

I assume your approach employs some good libraries as opposed to generative AI to accomplish its seemingly accurate extractions? If true, this could be an advantage because many firms are generative AI-averse and try to avoid such dependencies.

Nick_HE · October 14, 2023, 7:23pm

Ooh cool! I’ve been dabbling in a bit of OCR stuff lately too. What do you use under the hood? Tesseract?

Doug_Loud · October 14, 2023, 8:19pm

If this does what I hope it does, it’ll be a real game changer. Thanks!

Rickard_Abraham · October 17, 2023, 12:42am

Thanks everyone for showing such great interest!

I’m using Google’s powerful Document AI and I’m amazed at how quick and accurate it is.

Before discovering this fantastic solution, I experimented with various Python packages on both GPU-less servers and GPU-enabled servers.

If you have any requests for this pack then please let me know!

Rickard_Abraham · October 17, 2023, 2:50am

Implemented better error messages!

Rickard_Abraham · October 17, 2023, 3:07am

Privacy and security

Added the privacy policy with CCPA and GDPR compliance (same as my PDF pack)
API: Document AI Security & Compliance | Google Cloud
Server: How safe and secure is our data on PythonAnywhere? : Forums : PythonAnywhere

Rickard_Abraham · November 30, 2023, 12:15pm

Lowered price to $5 per doc maker

Rickard_Abraham · December 8, 2023, 2:57am

Someone seems to be hitting the api maliciously, had to disable this pack for the time being, sorry about any inconvenience. Happy to refund if you contact me

Had hoped not have to implement a long-term rate limit. Coda has a built-in rate limit but the time limit is 60 seconds, which I’m afraid isn’t long enough:

sisteradmn · January 25, 2024, 12:37pm

Is there a way to implement this just in my doc without a pak? …and not for sharing? Happy to pay for your expertise. — I am just looking for a quicker way to do receipt data entry from a scanned receipt.

Rickard_Abraham · January 25, 2024, 5:10pm

I’m glad you’re reaching out! I’m sure I can revive this privately for you

In the big picture I am striving towards a better permanent public solution for the rate limiting issue, but that will take a while.

Lets continue this conversation in dms

Rickard_Abraham · March 6, 2024, 11:31am

The OCR Pack has now been revived!

A token is now required
Message me privately here to get yours for free
Speaking of free, you can still use the 2 week trial!

I hope this will fix the issue of the server being vulnerable to malicious attacks. In order to deliver the best OCR experience possible to all Coda users I have my own credit card hooked up to use Google’s API. The attack 3 months ago costed me around $80 before I caught it, could’ve been much worse ^^

Ben_Peine · March 22, 2024, 7:21pm

@Rickard_Abraham In you example, you have the produce invoice as one of the PDF’s, which is my exact use case. I’m curious if you have a suggestion to how to use the extracted text, whether through AI or a formula, to capture the price of specific items on the invoice? The way it is extracted in just one column without any structure stumps me at the moment. My dream would be to scan the invoice with the OCR pack and then have the price for those produce items be automatically updated in a separate table.

Rickard_Abraham · March 22, 2024, 9:30pm

Great question @Ben_Peine!

Natural Order

It’s attempting to return the result in a natural reading order, meaning that in most cases it should be possible to construct invoice items thanks to the order (with or without AI).

Specialized Invoice Processor

Beyond this we’re looking at specialized invoice processors, which would require sample invoices provided from you to train it to fit your needs

Pricing Model

I’ve discussed the pricing model with @Christiaan_Huizer which resulted in me now striving towards an external pre/post paid system for extended OCR use!

This would be required before I could provide a specialized processor as they are much more expensive than the general processor I’m currently providing (still talking less than a few cents per page!)

I’ll keep you updated here with my progress, and please let me know what you think

Rickard_Abraham · April 28, 2024, 1:07pm

New formula: `Scan(Media)`

Lets you mix PDFs and Images
Deprecates the original ScanImages and ScanPDFs formulas.

Credits

Each token (message me to get yours!) has 1000 monthly credits.
Each page and image scan costs 1 credit

Rickard_Abraham · May 2, 2024, 3:17pm

New Sync Table: Requests this month

See each request you’ve made this month, how many credits it took, and how many remains

New Formula: RemainingCredits()

Get a simple number for how many credits you have left this month

New output options for Scan()

Plain Text

Original and default output

List of Texts

Use this option to get a list of texts.
Coda currently has a 85 kB limit when writing to a cell, each text will be below this limit, letting you create rows without worrying.
Concatenate() the rows afterwards for the complete scan result

Text File

Get a temporary txt file URL which you can ingest into a File column to store it permanently in Coda.
Name the file with the optional 3:rd filename parameter.
To expose the actual text within this file, use the new ReadTextFile() formula (formulaic columns don’t have the same 85 kB limit)

Rickard_Abraham · May 13, 2024, 2:34pm

New automated payment model

Get your own token with this Coda form
50 free credits are included
Refill it with Stripe using the new BuyCreditsUrl() formula (200 credits per dollar)
The pack is now free!
Updated Requests sync table to include refills:

Christiaan_Huizer · May 13, 2024, 4:31pm

great job @Rickard_Abraham ,
looking forward testing it any time soon!
cheers, christiaan

Topic		Replies	Views
🚀 New Pack Alert: Claude AI with Smart Image Analysis! [BETA] Showcase	0	106	June 15, 2025
Looking for OCR API to Coda Requests and Gigs	3	1307	October 13, 2023
New ChatPDF Pack – Chat to PDF files usign AI Showcase	0	227	April 6, 2024
API for discovering what packs an workspace/document is using Making Packs	0	389	October 14, 2022
Released: PDF Pack Showcase	31	2429	March 4, 2025