2e8fe364cb
Signed-off-by: Magnus Enger <magnus@enger.priv.no> Signed-off-by: Chris Cormack <chrisc@catalyst.net.nz>
28 lines
1.1 KiB
Text
28 lines
1.1 KiB
Text
The robots.txt file.
|
|
|
|
Search engines, when looking for sites to show in search results, will first
|
|
look for the file /robots.txt. If this file is found and has lines that apply
|
|
to them they will do as instructed. A very basic robots.txt follow as an
|
|
example:
|
|
|
|
-------------------------------------------
|
|
# go away
|
|
User-agent: *
|
|
Disallow: /
|
|
-------------------------------------------
|
|
|
|
This tells every search engine that cares (User-agent: *) to not index the site
|
|
(Disallow everything past /).
|
|
|
|
Another slightly more intelligent robots.txt file example allows for some bot indexing (good for your site in google, etc), but also stops your Koha from getting thrashing by ignoring URLs that cause heavy search load
|
|
|
|
-------------------------------------------
|
|
# do some indexing, but dont index search URLs
|
|
User-agent: *
|
|
Disallow: /cgi-bin/koha/opac-search.pl
|
|
-------------------------------------------
|
|
|
|
If you have installed Koha to /usr/local/koha3 then this file would be placed
|
|
in the directory /usr/local/koha3/opac/htdocs/. This should prevent search
|
|
engines from browsing every biblio record, and every view of each record, on
|
|
your Koha install periodically.
|