For buyers contracting authority
Pick 72312200 when automatic text recognition is the point of the contract, not manual keying. The cleanest test: if a machine reads the characters off a scanned image and outputs text, this is the code; if staff transcribe the data by hand, the parent Data entry services (72312000) fits better.
The boundary that trips contracting authorities is the line between scanning and recognition. Capturing the image is Scanning services (79999100); converting that image into machine-readable characters is OCR. Most real digitisation tenders bundle both, so set the primary CPV code by the dominant deliverable and tag the secondary scope as a supporting code.
The sibling to keep in view is Data preparation services (72312100), which covers cleaning, formatting and structuring data once it exists. Where a contract folds scanning, OCR and downstream structuring into one award, classify by whichever stage carries the most weight in the statement of work rather than splitting hairs across three leaves.