A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
您最多选择25个主题 主题必须以字母或数字开头,可以包含连字符 (-),并且长度不得超过35个字符
 
 
 
 
 
Ben Kurtovic 1d91b9171b Clean up CSS/JS; new minify pipeline; rework footer 3 年前
copyvios Even better error handling 3 年前
logs Always have a log dir. 10 年前
scripts Add log analysis script and improve API docs 4 年前
static Clean up CSS/JS; new minify pipeline; rework footer 3 年前
templates Clean up CSS/JS; new minify pipeline; rework footer 3 年前
.gitignore Clean up CSS/JS; new minify pipeline; rework footer 3 年前
LICENSE Clean up CSS/JS; new minify pipeline; rework footer 3 年前
README.md Clean up CSS/JS; new minify pipeline; rework footer 3 年前
app.py Update URLs 4 年前
build.py Clean up CSS/JS; new minify pipeline; rework footer 3 年前
schema.sql More additions for sqlite support 5 年前

README.md

This is a copyright violation detector running on Wikimedia Cloud Services.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.

Dependencies

Running

  • If using Toolforge, you should clone the repository to ~/www/python/src, or otherwise symlink it to that directory. A virtualenv should be created at ~/www/python/venv.

  • Install all dependencies listed above.

  • Create an SQL database with the cache and cache_data tables defined by earwigbot-plugins.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      _copyviosSQL:
          host: <hostname of database server>
          db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the _copyviosSQL section.

  • Run ./build.py to minify JS and CSS files.

  • Start the web server (on Toolforge, webservice uwsgi-python start).