This is a copyright violation detector running on Wikimedia Cloud Services.
It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.
If using Toolforge, you should clone the repository to ~/www/python/src
, or
otherwise symlink it to that directory. A
virtualenv should be created at
~/www/python/venv
.
Install all dependencies listed above.
Create an SQL database with the cache
and cache_data
tables defined by
earwigbot-plugins.
Create an earwigbot instance in .earwigbot
(run earwigbot .earwigbot
). In
.earwigbot/config.yml
, fill out the connection info for the database by
adding the following to the wiki
section:
_copyviosSQL:
host: <hostname of database server>
db: <name of database>
If additional arguments are needed by oursql.connect()
, like usernames or
passwords, they should be added to the _copyviosSQL
section.
Run ./build.py
to minify JS and CSS files.
Start the web server (on Toolforge, webservice uwsgi-python start
).