A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Ben Kurtovic 69819ed3f8 @catch_errors for api routes. 10 年之前
copyvios Rearrange functions in api.py. 10 年之前
logs Always have a log dir. 10 年之前
static JSON walker for jsonfm mode. 10 年之前
templates JSON walker for jsonfm mode. 10 年之前
.gitignore Fix gitignore for logs. 10 年之前
.lighttpd.conf Begin conversion to Flask; updates. 10 年之前
LICENSE Updates. 10 年之前
README.md Fix bug in README. 10 年之前
app.fcgi @catch_errors for api routes. 10 年之前
build.py Begin conversion to Flask; updates. 10 年之前
schema.sql Adjust db name. 11 年之前

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL.

Dependencies

Running

  • Install all dependencies listed above. You might want to use a virtualenv.

  • Create the SQL database defined in schema.sql. Also create the cache and cache_data tables defined by earwigbot-plugins; this can be in the same or a different database.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database(s) above by adding the following to the wiki section:

      _copyviosSQL:
          globals:
              host: <hostname of database defined in schema.sql>
              db:   <name of database>
          cache:
              host: <hostname of database containing cache and cache_data tables>
              db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the globals and cache sections.

  • Copy .lighttpd.conf to the relevant location (on Tool Labs, this is in the root of the project’s home directory) and adjust its contents as necessary.

  • Run ./build.py to minify JS and CSS files.

  • Adjust the hashbang in app.fcgi to point to the correct Python interpreter or virtual environment.

  • Start lighttpd (on Tool Labs, webservice start).