AI-103 Certification Practice Question #54

Question

You have a Microsoft Foundry project that contains an agent.
The agent uses a knowledge source built from documents stored in Azure Blob Storage. The documents include digitally scanned PDFs that contain multipage tables.
You have an ingestion job that extracts only plain text, causing loss of table structure, headings, and page-number metadata.
Users frequently ask questions that require the retrieval of specific table rows across the pages.
You need to configure an ingestion job for a Retrieval Augmented Generation (RAG) pipeline that performs optical character recognition (OCR) on scanned PDFs, preserves tables and headings as structure-aware chunks, and stores page-number metadata with each chunk.
How should you configure the ingestion job?

Accepted Answer

The requirement is OCR on scanned PDFs plus structure-aware chunks that keep tables and headings and store page-number metadata. C and D are clearly wrong: a single chunk per page destroys row-level structure, and basic + fixed-size chunking is the exact problem being replaced. The genuine contest is A (advanced data parsing) vs B (OCR and page-level chunking). The Suggested Answer is B, and "OCR + page-level chunking" directly names OCR (mandatory for scanned PDFs) while page-level processing is what lets page-number metadata be attached to each chunk. However, MS Learn's "advanced data parsing" description matches the requirement wording almost verbatim (preserves headings and tables, merges cross-page tables, stores headings and page numbers), which is why confidence is lowered to 70% — the two high-weight signals (Suggested Answer vs. MS Learn feature description) genuinely conflict, and there is no community vote to break the tie. I report B per the official Suggested Answer, but flag this as ambiguous.

More AI-103 practice questions