Capturing Sogou web spider entries from Apache Web log file

Q

How to capture the Sogou web spider entries from Web log file?

Here are some Web log file entries:

✍: Guest

A

Sogou Web Spider is a Web crawler from Sogou.com that obtains content for the Sogou Search engine. Sogou web spider uses the following user agent string:

Sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm#07)

The regular expression to capture Sogou Web Spider entries from Apache Web log file can be written as: with the multiple lines modifier "m" specified:

Click the button to test this regular expression here online:

2013-02-04, 0👍, 0💬