Ben Kurtovic 8321a07ba9 | há 5 anos | |
---|---|---|
copyvios | há 5 anos | |
logs | há 10 anos | |
static | há 7 anos | |
templates | há 5 anos | |
.gitignore | há 10 anos | |
LICENSE | há 8 anos | |
README.md | há 5 anos | |
app.py | há 5 anos | |
build.py | há 10 anos | |
schema.sql | há 5 anos |
This is a copyright violation detector running on Wikimedia Labs.
It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.
If using Tool Labs, you should clone the repository to ~/www/python/src
, or
otherwise symlink it to that directory. A
virtualenv should be created at
~/www/python/venv
.
Install all dependencies listed above.
Create an SQL database with the cache
and cache_data
tables defined by
earwigbot-plugins.
Create an earwigbot instance in .earwigbot
(run earwigbot .earwigbot
). In
.earwigbot/config.yml
, fill out the connection info for the database by
adding the following to the wiki
section:
_copyviosSQL:
host: <hostname of database server>
db: <name of database>
If additional arguments are needed by oursql.connect()
, like usernames or
passwords, they should be added to the _copyviosSQL
section.
Run ./build.py
to minify JS and CSS files.
Start the web server (on Tool Labs, webservice2 uwsgi-python start
).