Upload Document
Uploading a document works in two steps. When you click the Upload Document
button, you are presented with a file selector to select a local document for upload. Some supported files are .docx, .doc, .pdf, .txt, .csv, .json, and .xlsx.
mAIstro will automatically run OCR processing if a PDF is uploaded, but returns no text contents.
After the document is successfully uploaded, it is available in the Upload Document
pane:
The uploaded document can then be used with the following syntax:
{{ doc|name:output.csv }}
Parameters
- File Upload: The file to be processed.
Returns
- The plain text of the document. If an image-based PDF is uploaded, returning no text from the scraper, we will automatically return OCR'd text from the document.
OCR an Image
mAIstro’s OCR feature automatically processes image-based PDFs and images, converting them into searchable, editable text.
OCR'ing a document works in two steps. When you click the OCR an Image
button, you are presented with a file selector to select a local document for upload. Some supported files are .pdf, .png, .jpeg.
After the document is successfully uploaded, it is available in the Upload Document
pane:
The uploaded document or image can then be used with the following syntax:
{{ doc|name:screenshot_2024-11-05.png }}
Parameters
- File Upload: PDF or image file to be processed with OCR.
Returns
- The plain text of the document. If an image-based PDF is uploaded, returning no text from the scraper, we will automatically return OCR'd text from the document.
Example 1: Using OCR with PDF Files
-
Go to Upload Data in mAIstro.
-
Select Upload Document or OCR an Image and choose your PDF file.
-
OCR will be automatically applied, transforming the document into searchable text.
NTL Snippet:
{{ doc | name: "example.pdf" }}
{{ LLM | prompt: "List names in this document:" | cache: "true" }}
Returns
This example returns a text-rich version of example.pdf
, with names extracted as specified in the prompt.
Example 2: Using OCR with Image Files
-
Go to Upload Data in mAIstro.
-
Select OCR an Image to upload an image file.
-
OCR processing starts automatically, converting the image into searchable text.
NTL Snippet:
{{ doc | name: "image.png" }}
{{ LLM | prompt: "List names in this document:" | cache: "true" }}
Returns
This example returns a text-rich version of image.png
, optimized for data extraction and analysis.