Add a note about FPDFText_GetText() behavior.

Explain that the returned results contain characters outside the
cropbox, and suggest some APIs to call to determine if that is the case.

Bug: pdfium:1842
Change-Id: I2ac79668184f5313a14a3d3d6f3f4a8eda06d678
Reviewed-on: https://pdfium-review.googlesource.com/c/pdfium/+/95251
Reviewed-by: Nigi <nigi@chromium.org>
Commit-Queue: Lei Zhang <thestig@chromium.org>
diff --git a/public/fpdf_text.h b/public/fpdf_text.h
index 6d4a020..65604d8 100644
--- a/public/fpdf_text.h
+++ b/public/fpdf_text.h
@@ -341,6 +341,10 @@
 //          trailing terminator.
 // Comments:
 //          This function ignores characters without unicode information.
+//          It returns all characters on the page, even those that are not
+//          visible when the page has a cropbox. To filter out the characters
+//          outside of the cropbox, use FPDF_GetPageBoundingBox() and
+//          FPDFText_GetCharBox().
 //
 FPDF_EXPORT int FPDF_CALLCONV FPDFText_GetText(FPDF_TEXTPAGE text_page,
                                                int start_index,