A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
Vous ne pouvez pas sélectionner plus de 25 sujets Les noms de sujets doivent commencer par une lettre ou un nombre, peuvent contenir des tirets ('-') et peuvent comporter jusqu'à 35 caractères.
 
 
 
 
 
Ben Kurtovic b2894c6c0a Report possible misses as well as known skips. il y a 10 ans
copyvios Truncate URLs above 1024 chars. il y a 10 ans
logs Always have a log dir. il y a 10 ans
static Adjust indent size; fix rendering of strings. il y a 10 ans
templates Report possible misses as well as known skips. il y a 10 ans
.gitignore Fix gitignore for logs. il y a 10 ans
.lighttpd.conf Begin conversion to Flask; updates. il y a 10 ans
LICENSE Updates. il y a 10 ans
README.md Link to some technical details. il y a 10 ans
app.fcgi Update sites before doing check. il y a 10 ans
build.py Begin conversion to Flask; updates. il y a 10 ans

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.

Dependencies

Running

  • Install all dependencies listed above. You might want to use a virtualenv.

  • Create an SQL database with the cache and cache_data tables defined by earwigbot-plugins.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      _copyviosSQL:
          host: <hostname of database server>
          db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the _copyviosSQL section.

  • Copy .lighttpd.conf to the relevant location (on Tool Labs, this is in the root of the project’s home directory) and adjust its contents as necessary.

  • Run ./build.py to minify JS and CSS files.

  • Adjust the hashbang in app.fcgi to point to the correct Python interpreter or virtual environment.

  • Start lighttpd (on Tool Labs, webservice start).