cancel
Showing results for 
Search instead for 
Did you mean: 
Read only

PDF as Source in Text Data Processing

Former Member
0 Likes
530

Hi All,

I have read that Text Data Processing supports pdf,word and other binary formats but i dont understand how use the pdf/word as source.

Can anyone explain or guide me a work around.

Thanks,

srinivas

View Entire Topic
former_member187605
Active Contributor
0 Likes

In the file format definition set Type to Unstructured Text. The output schema will look like this:

Then use a TDP Entity_Extraction transform to process te contents.