I have a project which involves a large number (about 40,000) Microsoft Word documents which are massively hyperlinked to an even larger number (about 60,000) of PDF files which themselves contain many hyperlinks to local PDF files. When I try to do text searches on these files after trimming the number of files down enough to get under the limits I mentioned in another post, I discovered that while the hyperlinks in PDF files appear as plain text in the search results, the display text for the hyperlinks in the word documents have underscores inserted at the beginning, at the end and between every word in the text making it much harder to search. If it necessary for those underscores to be present? Could they be made optional?
AT is on the way to becoming as useful as Google desktop was but some issues like this are still standing in the way.