A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
您最多选择25个主题 主题必须以字母或数字开头,可以包含连字符 (-),并且长度不得超过35个字符
 
 
 
 
 
Ben Kurtovic f2c1d5b853 Make cache.langs, cache.projects sorted lists. 10 年前
copyvios Make cache.langs, cache.projects sorted lists. 10 年前
logs Always have a log dir. 10 年前
static Fix bug in Javascript. 10 年前
templates Ugly solution for escaping URLs in permlinks. 10 年前
.gitignore Fix gitignore for logs. 10 年前
.lighttpd.conf Begin conversion to Flask; updates. 10 年前
LICENSE Updates. 10 年前
README.md Link to some technical details. 10 年前
app.fcgi Make cache.langs, cache.projects sorted lists. 10 年前
build.py Begin conversion to Flask; updates. 10 年前

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.

Dependencies

Running

  • Install all dependencies listed above. You might want to use a virtualenv.

  • Create an SQL database with the cache and cache_data tables defined by earwigbot-plugins.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      _copyviosSQL:
          host: <hostname of database server>
          db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the _copyviosSQL section.

  • Copy .lighttpd.conf to the relevant location (on Tool Labs, this is in the root of the project’s home directory) and adjust its contents as necessary.

  • Run ./build.py to minify JS and CSS files.

  • Adjust the hashbang in app.fcgi to point to the correct Python interpreter or virtual environment.

  • Start lighttpd (on Tool Labs, webservice start).