Capturing Soso Spider entries from Apache Web log file

Q

How to capture the Soso Spider entries from Web log file?

Here are some Web log file entries:

✍: Guest

A

Soso Spider is a Web crawler from soso.com that obtains content for Soso search engine. Soso spider uses the following user agent string:

Sosospider+(+http://help.soso.com/webspider.htm)

The regular expression to capture Jike Spider entries from Apache Web log file can be written as: with the multiple lines modifier "m" specified:

Click the button to test this regular expression here online:

2013-02-04, 6310👍, 0💬