Web Crawlers

I am looking for a newsgroup or forum that discusses best practices in
using methods to influence how web crawlers record a website.

Information will be appreciated,
Stan Hilliard
Stan Hilliard [ Di, 05 Juni 2007 21:21 ] [ ID #1730975 ]

Re: Web Crawlers

Stan Hilliard <usenetreplyUM [at] sampling4plansNOTSPAM.com> wrote in
news:bhdb635dtmeeqsgd37q343l8plsro6kl33 [at] 4ax.com:

> I am looking for a newsgroup or forum that discusses best practices in
> using methods to influence how web crawlers record a website.
>
> Information will be appreciated,
> Stan Hilliard
>

http://www.robotstxt.org/wc/faq.html
Fuzzy Logic [ Di, 05 Juni 2007 22:34 ] [ ID #1730981 ]

Re: Web Crawlers

On Tue, 05 Jun 2007 20:34:58 GMT, Fuzzy Logic
<bob [at] arc.ab.caREMOVETHIS> wrote:

>Stan Hilliard <usenetreplyUM [at] sampling4plansNOTSPAM.com> wrote in
>news:bhdb635dtmeeqsgd37q343l8plsro6kl33 [at] 4ax.com:
>
>> I am looking for a newsgroup or forum that discusses best practices in
>> using methods to influence how web crawlers record a website.
>>
>> Information will be appreciated,
>> Stan Hilliard
>>
>
>http://www.robotstxt.org/wc/faq.html

Thanks. I have placed a robots.txt on my website. But I notice from my
log files that many web crawlers do not even check it. Can some web
crawlers be malicious?

Is there a free tool that can analyze a web site to determine its
vulnerabilities?

Stan Hilliard
Stan Hilliard [ Mi, 13 Juni 2007 15:08 ] [ ID #1737145 ]

Re: Web Crawlers

On Wed, 13 Jun 2007 08:08:00 -0500, Stan Hilliard
<usenetreplyUM [at] sampling4plansNOTSPAM.com> wrote:

>On Tue, 05 Jun 2007 20:34:58 GMT, Fuzzy Logic
><bob [at] arc.ab.caREMOVETHIS> wrote:
>
>>Stan Hilliard <usenetreplyUM [at] sampling4plansNOTSPAM.com> wrote in
>>news:bhdb635dtmeeqsgd37q343l8plsro6kl33 [at] 4ax.com:
>>
>>> I am looking for a newsgroup or forum that discusses best practices in
>>> using methods to influence how web crawlers record a website.
>>>
>>> Information will be appreciated,
>>> Stan Hilliard
>>>
>>
>>http://www.robotstxt.org/wc/faq.html
>
>Thanks. I have placed a robots.txt on my website. But I notice from my
>log files that many web crawlers do not even check it. Can some web
>crawlers be malicious?
>
>Is there a free tool that can analyze a web site to determine its
>vulnerabilities?
>
>Stan Hilliard

My web site is on a remote windows 2000 server running IIS & MS
Frontpage. It is hosted by an internet service provider. I am mostly
interested in whether permissions are properly set, what
vulnerabilities it has, and occurrences or clues of malicious
intrusion.

Stan Hilliard
Stan Hilliard [ Mi, 13 Juni 2007 15:31 ] [ ID #1737146 ]
Miscellaneous » comp.security.misc » Web Crawlers

Vorheriges Thema: digitally sign office and pdf's???
Nächstes Thema: HPSBUX02219 SSRT061273 rev.1 - HP-UX Running BIND, Remote Denial of Service (DoS)