Say one has a collection of docs: doc1 <para><sentence><bold>That</bold></sentence></para> doc2 <para><sentence><bold>That</bold></sentence></para> doc3 <paragraph><strong>That</strong></paragraph> ....doc20000 (many docs) I am looking for a solution(application, ideas, designs) that would return: 1. A listing of xpaths to elements para para/sentence paragraph/strong OR 2. A schema from the docs in a collection. OR 3. Other ideas? |