A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
選択できるのは25トピックまでです。 トピックは、先頭が英数字で、英数字とダッシュ('-')を使用した35文字以内のものにしてください。
 
 
 
 
 
Ben Kurtovic c9066e6259 More Python 3 conversion and heavy refactoring 1ヶ月前
logs Always have a log dir. 10年前
scripts Initial Python 3 conversion 2ヶ月前
src/copyvios More Python 3 conversion and heavy refactoring 1ヶ月前
static More Python 3 conversion and heavy refactoring 1ヶ月前
templates More Python 3 conversion and heavy refactoring 1ヶ月前
.gitignore Initial Python 3 conversion 2ヶ月前
.pre-commit-config.yaml More Python 3 conversion and heavy refactoring 1ヶ月前
LICENSE Initial Python 3 conversion 2ヶ月前
Makefile More Python 3 conversion and heavy refactoring 1ヶ月前
README.md More Python 3 conversion and heavy refactoring 1ヶ月前
app.py More Python 3 conversion and heavy refactoring 1ヶ月前
pyproject.toml More Python 3 conversion and heavy refactoring 1ヶ月前
schema.sql More additions for sqlite support 5年前

README.md

This is a copyright violation detector web tool for Wikipedia articles running on Wikimedia Cloud Services at copyvios.toolforge.org.

It can search the web for content similar to a given article, and graphically compare an article to specific URLs. Some technical details are expanded upon in a blog post, though much of it is outdated.

Installation

  • If using Toolforge, clone the repository to ~/www/python/src, or otherwise symlink it to that directory.

  • Create a virtual environment and install the dependencies. On Toolforge, this should be in ~/www/python/venv, otherwise it can be in a subdirectory of the git project named venv:

    python3 -m venv venv
    . venv/bin/activate
    pip install -e .
    
  • If you intend to modify CSS or JS, install the frontend dependencies:

    npm install -g uglify-js cssnano postcss postcss-cli
    
  • Create an SQL database with the tables defined by schema.sql.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      copyvios:
          engine: mysql
          host: <hostname of database server>
          db: <name of database>
    

Running

  • Run make to minify JS and CSS files after making any frontend changes.

  • Start your WSGI server pointing to app:app. For production, uWSGI or Gunicorn are likely good options. For development, use flask run.