Released: OCR Pack

Hey everyone,
Today I’m launching the OCR Pack!

It lets you read text in images and PDFs :partying_face:

It’s taken a lot of attempts, research, and experimenting - I’m stoked for this solution and I hope you are too!

Here’s a doc with examples

13 Likes

So exciting! Been wanting something like this for a while

You rock!

1 Like

Hi @Rickard_Abraham , this could be interesting.
Would you mind creating a video showing some use cases?
Don’t hesitate to share your story in more detail, what looks easy, rarely is.
cheers, Christiaan

2 Likes

I can imagine. Some late nights dialing this in I’m sure. :slight_smile:

I assume your approach employs some good libraries as opposed to generative AI to accomplish its seemingly accurate extractions? If true, this could be an advantage because many firms are generative AI-averse and try to avoid such dependencies.

1 Like

Ooh cool! I’ve been dabbling in a bit of OCR stuff lately too. What do you use under the hood? Tesseract?

1 Like

If this does what I hope it does, it’ll be a real game changer. Thanks!

1 Like

Thanks everyone for showing such great interest!

I’m using Google’s powerful Document AI and I’m amazed at how quick and accurate it is.

Before discovering this fantastic solution, I experimented with various Python packages on both GPU-less servers and GPU-enabled servers.

If you have any requests for this pack then please let me know!

2 Likes

Implemented better error messages!

image image image image

Privacy and security

Lowered price to $5 per doc maker :tada:

1 Like

Someone seems to be hitting the api maliciously, had to disable this pack for the time being, sorry about any inconvenience. Happy to refund if you contact me

Had hoped not have to implement a long-term rate limit. Coda has a built-in rate limit but the time limit is 60 seconds, which I’m afraid isn’t long enough:

2 Likes

Is there a way to implement this just in my doc without a pak? …and not for sharing? Happy to pay for your expertise. :heart: — I am just looking for a quicker way to do receipt data entry from a scanned receipt.

2 Likes

I’m glad you’re reaching out! I’m sure I can revive this privately for you :+1:

In the big picture I am striving towards a better permanent public solution for the rate limiting issue, but that will take a while.

Lets continue this conversation in dms :slight_smile:

3 Likes

The OCR Pack has now been revived!

  • A token is now required
  • Message me privately here to get yours for free
  • Speaking of free, you can still use the 2 week trial!

image


I hope this will fix the issue of the server being vulnerable to malicious attacks. In order to deliver the best OCR experience possible to all Coda users I have my own credit card hooked up to use Google’s API. The attack 3 months ago costed me around $80 before I caught it, could’ve been much worse ^^

5 Likes

@Rickard_Abraham In you example, you have the produce invoice as one of the PDF’s, which is my exact use case. I’m curious if you have a suggestion to how to use the extracted text, whether through AI or a formula, to capture the price of specific items on the invoice? The way it is extracted in just one column without any structure stumps me at the moment. My dream would be to scan the invoice with the OCR pack and then have the price for those produce items be automatically updated in a separate table.

1 Like

Great question @Ben_Peine!

Natural Order

It’s attempting to return the result in a natural reading order, meaning that in most cases it should be possible to construct invoice items thanks to the order (with or without AI).

Specialized Invoice Processor

Beyond this we’re looking at specialized invoice processors, which would require sample invoices provided from you to train it to fit your needs

Pricing Model

I’ve discussed the pricing model with @Christiaan_Huizer which resulted in me now striving towards an external pre/post paid system for extended OCR use!

This would be required before I could provide a specialized processor as they are much more expensive than the general processor I’m currently providing (still talking less than a few cents per page!)


I’ll keep you updated here with my progress, and please let me know what you think :slight_smile:

1 Like

New formula: Scan(Media)

Lets you mix PDFs and Images :ok_hand:
Deprecates the original ScanImages and ScanPDFs formulas.

Credits

Each token (message me to get yours!) has 1000 monthly credits.
Each page and image scan costs 1 credit

3 Likes

New Sync Table: Requests this month

See each request you’ve made this month, how many credits it took, and how many remains

image

New Formula: RemainingCredits()

Get a simple number for how many credits you have left this month

New output options for Scan()

Plain Text

Original and default output

List of Texts

Use this option to get a list of texts.
Coda currently has a 85 kB limit when writing to a cell, each text will be below this limit, letting you create rows without worrying.
Concatenate() the rows afterwards for the complete scan result

Text File

Get a temporary txt file URL which you can ingest into a File column to store it permanently in Coda.
Name the file with the optional 3:rd filename parameter.
To expose the actual text within this file, use the new ReadTextFile() formula (formulaic columns don’t have the same 85 kB limit)

2 Likes

New automated payment model

  • Get your own token with this Coda form
  • 50 free credits are included
  • Refill it with Stripe using the new BuyCreditsUrl() formula (200 credits per dollar)
  • The pack is now free!
  • Updated Requests sync table to include refills:

image

3 Likes

great job @Rickard_Abraham ,
looking forward testing it any time soon!
cheers, christiaan

1 Like