How to load & query word/excel documents in Marklogic Server?

Nickest

New Member
I want to load MS Office word/excel documents into Marklogic and want to query them with xquery as done with xml documents. But when I load doc files into Marklogic it loads them into binary format and shows junk characters when viewed with query console. I tried loading with following command:-\[code\]xdmp:document-load("E:\doc\sample.doc", <options xmlns="xdmp:document-load" xmlns:http="xdmp:http"> <format>xml</format> </options>)\[/code\]But it shows an error that says document is not UTF 8 Encoded. I want to know whether doc and xls files can be loaded as it is into Marklogic or they have to be converted to xml or UTF 8 Encoded format before loading them. If yes, then what is the process of converting them. If no, then how can we query them with xquery. I also want to know if MS Office 2007/2010 installation is necessary for the conversion process because both Office 2007 & 2010 support OOXML format.Please give me proper guidance about this.
 
Back
Top