A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
Non puoi selezionare più di 25 argomenti Gli argomenti devono iniziare con una lettera o un numero, possono includere trattini ('-') e possono essere lunghi fino a 35 caratteri.
 
 
 
 
 
Ben Kurtovic 5839f74850 Update for new uWSGI stuff. 9 anni fa
copyvios Make cache.langs, cache.projects sorted lists. 10 anni fa
logs Always have a log dir. 10 anni fa
static Fix bug in Javascript. 10 anni fa
templates Update copyright year. 10 anni fa
.gitignore Fix gitignore for logs. 10 anni fa
LICENSE Update copyright year. 10 anni fa
README.md Update for new uWSGI stuff. 9 anni fa
app.py Update for new uWSGI stuff. 9 anni fa
build.py Begin conversion to Flask; updates. 10 anni fa

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.

Dependencies

Running

  • If using Tool Labs, you should clone the repository to ~/www/python/src, or otherwise symlink it to that directory. A virtualenv should be created at ~/www/python/venv.

  • Install all dependencies listed above.

  • Create an SQL database with the cache and cache_data tables defined by earwigbot-plugins.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      _copyviosSQL:
          host: <hostname of database server>
          db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the _copyviosSQL section.

  • Run ./build.py to minify JS and CSS files.

  • Start the web server (on Tool Labs, webservice2 uwsgi-python start).