From: rhaddan on
I have a document management application that contains data about PDF documents
in a database with a reference to the file name to access the PDF, like:

id Name Number URL
1 Doc Title 1 100-199 100-199.pdf
2 Doc Title 2 100-287 100-287.pdf

We currently search on the name and number from the database, but users want
to search the contents of the PDF as well. The name and number are inside the
PDF, but there is no metadata to return clean search results.

Is there a way to combine the database query collection with the file search
collection to create a single, searchable collection without manipulating the
PDF files?