Q: What is Evliya Celebi ?
A: Evliya Celebi is a web robot searching Turkish content. It's name
comes from the famous Turkish Traveller.
Q: I don't want it to crawl my site, what can I do ?
A: You can add following entry into your robots.txt file at root
location.
User-Agent: EvliyaCelebi
Disallow: /
Q: Is source available ?
A: Yes version 0.14 and 0.17 is available as source code. New versions
may be published later.
Q: What is the difference between v0.14 and 0.17 ?
A: 0.17 uses MySQL as link database. But unfortunately I am too lazy to
give SQL definitions to create tables, look the source and do it
yourself. And there are some minor changes, bugfixes etc.
Q: Is it a fully qualified robot ?
A: In fact no. It is written as hobby, main purpose is gathering
statistics. It may still contain lots of bugs.
Q: What is it's speed ?
A: It can be run parallel among several computers, when 10-12 processes
are run on a linux (Ultra-5 sparc) it can gather about 100,000
documents a day.
Q: How does it select the URL to get ?
A: It makes a database of available URLs retrived from hyperlinks. Then
it randomly selects one of the links.
Q: My page has nothing to do with Turkish content, why did your spider
come here ?
A: It crawles pages under '.tr' domain or having Turkish character
encoding. If it found your site then your site is referred from a
Turkish content page. When it detects your page is not in Turkish
character encoding it will not follow links from your site and will not
retrive your page again.
Q: I changed my robots.txt but your robot did not recognize it ?
A: Well it tends to check robots.txt every 6 hours.