Pythagoria's crawling service harvests content from a file system (more than 100 target file formats managed), by querying a DBMS, a CMS (MS SharePoint, Drupal, Alfresco, WordPress and others), or a mail server. It can also automatically collect external content on HTTP/HTTPS servers, RSS / Atom feeds.