TomHH wrote:What is the difference between active and passive crawling?
In active crawling every peer acts like a normal crawler, downloading and indexing web pages autonomously, traversing the web by following links. All indpendent from the user behaviour.
The used algorithm makes sure, that different peers have almost no overlap between crawled pages, despite the fact that each peer is crawling autonomously and independently.
In passive crawling only those web pages are indexed, which have been previously visited by the user within the browser. Within the crawler queue passive crawling has priority.
This ensures that those pages are indexed first, where the current attention of the users is right now.
TomHH wrote:Can I prevent faroo to accept urls for crawling from other peers?
FAROO does not accept urls for crawling from other peers at all. Every peer crawls autonomously.
TomHH wrote:If both options are switched off: Is the web history still indexed?
There is another thing to keep in mind. If the crawling is disabled, the crawler queue will not be filled anymore with new entries, but entries left in the queue will still be processed.