A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Ben Kurtovic 8a2f1f022e Improve walk_json(). 10 years ago
copyvios Rearrange functions in api.py. 10 years ago
logs Always have a log dir. 10 years ago
static Adjust indent size; fix rendering of strings. 10 years ago
templates Improve walk_json(). 10 years ago
.gitignore Fix gitignore for logs. 10 years ago
.lighttpd.conf Begin conversion to Flask; updates. 10 years ago
LICENSE Updates. 10 years ago
README.md Fix bug in README. 10 years ago
app.fcgi @catch_errors for api routes. 10 years ago
build.py Begin conversion to Flask; updates. 10 years ago
schema.sql Adjust db name. 11 years ago

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL.

Dependencies

Running

  • Install all dependencies listed above. You might want to use a virtualenv.

  • Create the SQL database defined in schema.sql. Also create the cache and cache_data tables defined by earwigbot-plugins; this can be in the same or a different database.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database(s) above by adding the following to the wiki section:

      _copyviosSQL:
          globals:
              host: <hostname of database defined in schema.sql>
              db:   <name of database>
          cache:
              host: <hostname of database containing cache and cache_data tables>
              db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the globals and cache sections.

  • Copy .lighttpd.conf to the relevant location (on Tool Labs, this is in the root of the project’s home directory) and adjust its contents as necessary.

  • Run ./build.py to minify JS and CSS files.

  • Adjust the hashbang in app.fcgi to point to the correct Python interpreter or virtual environment.

  • Start lighttpd (on Tool Labs, webservice start).