I want to use PDF/TIFF Document Text Detection service from google cloud [closed]

问题

Google cloud vision api is very powerful and now they have support for pdf format, but the documentation is getting me confused, can someone pls guide a noob how to set up and process a pdf file using vision api.

kind of like starter tutorial

ref: https://cloud.google.com/vision/docs/pdf#vision-web-detection-gcs-protocol

The confusion is how to pass the command arguments, and send my sample file and retrieve the results in csv or json format

Should i use my windows command line or cloudshell on google cloud

There is good starter reference for other services , if you look at this https://cloud.google.com/vision/docs/ocr#vision-detect-labels-gcloud, they have clearly given commands to use in different envs

回答1:

You could use the following curl command as a base and run it within your Google Cloud Shell session:

curl -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) --header "Content-Type: application/json"   --request POST   --data '{ "requests":[  {"inputConfig": { "gcsSource": {  "uri": "gs://<Your Bucket>/<Your PDF FIle>"},"mimeType":"application/pdf"},"features": [{"type":"DOCUMENT_TEXT_DETECTION"}],"outputConfig": {"gcsDestination":{"uri": "gs://<Your destination bucket/"},"batchSize": 2}}]}' https://vision.googleapis.com/v1/files:asyncBatchAnnotate

Then, you'll get an operation object as a response like this: "name": "operations/<This is your operation ID>".

After that, you'll get your results with the GET request, for instance:

curl --request GET https://vision.googleapis.com/v1/operations/<The Operation ID>?key=<Your API Key>

来源：https://stackoverflow.com/questions/52614091/i-want-to-use-pdf-tiff-document-text-detection-service-from-google-cloud

标签

google-cloud-vision