When testing Document Processing, use the sample PDFs below. Upload these documents to test successful parsing, various tax form types, and suspicious document detection.Documentation Index
Fetch the complete documentation index at: https://truv.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Test scenarios use the file name to determine results. The sandbox ignores actual file contents. Only the file name matters.
Pay stubs
| Document | Download |
|---|---|
| Most recent paystub | most.recent.paystub.pdf |
| Next recent paystub | next.recent.paystub.pdf |
| First paystub | first.paystub.pdf |
Tax documents
| Document | Download |
|---|---|
| W-2 | w2.pdf |
| 1099-DIV | 1099div.pdf |
| 1099-G | 1099g.pdf |
| 1099-INT | 1099int.pdf |
| 1099-MISC | 1099misc.pdf |
| 1099-NEC | 1099nec.pdf |
| 1099-R | 1099r.pdf |
| SSA-1099 | ssa1099.pdf |
| 1040 | f1040.pdf |
For 1099 tax documents, Truv supports parsing formats from any year after 2021. This includes 1099-DIV, 1099-G, 1099-INT, 1099-MISC, 1099-NEC, and 1099-R.
Volunteer documents
| Document | Download |
|---|---|
| Volunteer letter | volunteer_letter.pdf |
| Volunteer timesheet | volunteer_timesheet.pdf |
Suspicious document detection
| Scenario | Description | Downloads |
|---|---|---|
| Tampered documents | Information is falsified or manipulated | Tampered 1, Tampered 2, Tampered 3 |
| Different SSNs | Personal information is inconsistent across documents | SSN 1, SSN 2, SSN 3 |
| Different applicant names | Personal information is inconsistent across documents | Applicant 1, Applicant 2, Applicant 3 |
| No data or invalid data | Information is missing or unable to be parsed | No data 1, No data 2, No data 3 |
suspicious. prefix before uploading. See the tip at the top of this page for the full rule and examples.
Validation failures
Use these files to test the file-level validation flags returned by the Document Collections API. Each file triggers a specific validation failure on the collection retrieve response. In the sandbox, only the file name determines the test scenario — the actual file contents are ignored. You can upload any PDF as long as the filename matches.| Document | Triggers | Download |
|---|---|---|
| Duplicate file | is_unique: false — upload any file twice in the same collection, or use the provided file | duplicate.pdf |
| Unreadable file | is_readable: false | unreadable.pdf |
| Password-protected file | is_accessible: false (password: paystub123) | password.pdf |
| Unsupported MIME type | is_supported_type: false | unsupported_type.pdf |
| Processing failure | status: failed after validation passes | failed.pdf |
Base64 encoding for Document Collections API
The Document Collections API accepts base64-encoded file content when creating or uploading to a collection. To encode a test document for use with the API:content field when creating a document collection:
In sandbox mode, the file name determines the test scenario, not the actual content. The base64 content can be from any valid PDF — only the filename matters for sandbox behavior.