Flexible Indexing and Site Integration
IT.com is able to crawl and index data from any data source published via HTTP or HTTPS (secure HTTP). This includes "dynamic" data sources that publish content from databases (such as Oracle, MS SQL Server, MySQL, Sybase, etc.), file servers, and content management systems. This also includes external, non-government websites.
We also provide comprehensive document search capabilities. IT.com can retrieve and index any file format commonly published to the web, including:
- HTML
- PDF and Postscript
- XML
- TXT and RTF
- Microsoft PowerPoint
- Microsoft Word
- Microsoft Excel
We can also index bundles of files published as archived formats like zip, tar, and gzip.
