Reputation: 17
I have gone through the 'https://docs.marklogic.com/guide/cpf/default' to set up the pipeline to make the binary document searchable. i can observe that .xml and .xhtml are being generated out of ingested file. when i tried searching using Java Client API search query, i got the results from generated xml file rather than getting the results from ingested file.
Please let me know how can i get the uri of the ingested document while performing search using Java Client API search query. Since i want to display the actual document's content so DOC uri would be useful in that.
Upvotes: 0
Views: 57
Reputation: 4912
Only the contents of the conversion products is indexed: the binary source file remains binary and unindexed, which is why it won't show up in your searches. If you want to get back to it, you can fetch the properties of the .xhtml document -- they will include a link property identifying the source URI. Or you can reverse engineer that URI from the URI of the .xhtml document -- the default conversion application uses a consistent and fairly simple naming scheme to generate those URIs.
Upvotes: 1