Searchable PDF (or keyword extraction)

Patricia_Hoffmann · May 16, 2023, 7:05am

Hi!

A team of my company is thinking about moving to coda.io to set up a product management database with a lot of pdf files…

However, they are currently tending to go with Sharepoint as PDF files uploaded there are searchable.
You can just type in a keyword and all the PDFs that you need are searched through.
I know this question has already been asked before in a similar way (Search within PDF and Images (OCR)), but I wanted to ask if

there is any (simple) workaround for this
@coda_account Is planning on implementing such a feature in the future (as they will go with coda.io then and just wait)
If there is a way to automatically at least extract keywords from a PDF in coda?

Thanks for any hint

Rickard_Abraham · May 16, 2023, 12:17pm

Hey! Depending on the complexity and encodings of the PDFs you might have some luck adding a column to simply read the file contents, which Coda search happily picks up on

PDF decoding is hard. To do this properly I’d try finding a node package that parses PDFs to use in a custom Coda pack

Patricia_Hoffmann · May 17, 2023, 9:35am

Thank you so much for your answer!
I tried it, but it didn’t turn out so well either

Rickard_Abraham · May 17, 2023, 12:42pm

Haha yeah not ideal, worth a try! If we’re lucky someone more knowledgable has an easy fix to make a few more words appear, something like configuring the encoding when reading the file in the coda pack perhaps

Rickard_Abraham · July 12, 2023, 6:38pm

Good news! Today I released a pack that can read PDFs

system · October 10, 2023, 6:39pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Search within PDF and Images (OCR) Suggestion Box	17	1793	October 13, 2023
Make PDFs in a table searchable Marketplace	6	925	July 12, 2023
Can Coda AI do this? (Images, PDFs, files...)	5	309	August 15, 2024
Coda AI Point to PDFs	12	995	October 11, 2023
PDF Extract Pack Showcase	5	706	January 16, 2024

Searchable PDF (or keyword extraction)

Related topics