Cause:
These documents are most likely emails with meta data only. If an email does not have a body, they have meta data only, then the 'Has Extracted Text' field will be 'No'. Sometimes Venio can only access the metadata of a file such as the author, the sender and recipient of an email, etc... but it cannot access the text of the body, or if the file is an unsent empty draft, there is no text to access.
The 'Extracted Text' viewer will take the metadata from an email message (such as the sender, recipient, subject, etc.) and combine it to form a text representation of the email.
Exporting Email Headers Generated From Metadata
If you want to produce/export the text representations of the metadata generated by the 'Extracted Text' viewer, select the option 'If the email body is empty, produce text file with email header only'. This will save the text generated by combining the metadata fields of the email into a single extracted text file. Screenshot attached for reference. 
Production Settings --> Full text --> Option
Summary
In cases where the email body cannot be accessed, the 'Has Extracted Text' field will be marked as 'No' even though the 'Extracted Text' within the Full Text viewer is populated. This is most likely in the case of emails which have meta-data only.
Comments
0 comments
Please sign in to leave a comment.