You are tasked with using the 'PARSE DOCUMENT' function in Snowflake to extract key information (name, address, phone number) from a large collection of scanned invoices stored as PDF files in an AWS S3 bucket. The invoices have varying formats and quality. Which of the following approaches would be MOST effective to structure the extracted data for analysis?

Question

Accepted Answer

C. Create a custom UDF (User-Defined Function) that calls 'PARSE_DOCUMENT and then uses

Answer

A. Use `PARSE DOCUMENT with default settings and load the raw JSON output into a VARIANT

Answer

B. Use `PARSE DOCUMENT with a pre-defined JSON schema to enforce a rigid structure on the

Answer

D. Employ a combination of 'PARSE DOCUMENT and Snowflake's external functions to integrate

Answer

E. Directly load PDF files into a relational table's TEXT column and write SQL queries utilizing LIKE

You are tasked with using the 'PARSE DOCUMENT' function in Snowflake to extract key information (name, address, phone number) from a large collection of scanned invoices stored as PDF files in an AWS

Question

Options

How the community answered

Explanation

Topics

Community Discussion