Ben Kurtovic
0692227496
Cleanup/tweaks, mainly to rendering.
pirms 9 gadiem
Ben Kurtovic
6d940b5c92
ast.literal_eval -> json.loads
pirms 9 gadiem
Ben Kurtovic
83f5588354
Disable turnitin by default for now, until we confirm it works as expected.
pirms 9 gadiem
Frances Hocutt
4e994f1302
Refactor turnitin.py, incorporate diff link/timestamp
* Add a wiki timestamp parser to copyvios/misc.py
* Refactor copyvios/turnitin.py for more sensible structure
* Update templates/index.mako to incorporate diff link/timestamp and
make it clearer that Turnitin is revision-based checking
pirms 9 gadiem
Frances Hocutt
1ffa87da0b
Improve turnitin.py docstrings, fix bugs
pirms 9 gadiem
Frances Hocutt
bf0aa22fa9
[WIP] improve docstrings and naming, mark TODO
pirms 9 gadiem
Frances Hocutt
efe2300250
[WIP] Basic working integration of turnitin
Add a checkbox to allow searching the EranBot/plagiabot database for
Turnitin results, and display them in a similar form to the on-wiki
EranBot reports if they exist.
Add a new module (copyvios/turnitin.py) to handle fetching and parsing
the EranBot results.
Bug: https://phabricator.wikimedia.org/T110144
TODO: tweak display HTML/CSS; refactor/clean up turnitin.py;
improve dev set-up so it doesn't always default to testwiki
and can test without hardcoding page title
pirms 9 gadiem
Ben Kurtovic
ad033a6e2b
Pass excluded URL data to the API.
pirms 9 gadiem
Ben Kurtovic
321e6e3352
Support new 'excluded' column in database.
pirms 9 gadiem
Ben Kurtovic
4092cbc0f6
Keep all workers' backgrounds in sync, and ensure they update at 00:00 UTC.
pirms 9 gadiem
Ben Kurtovic
b325ef84cd
Support general/update notices.
pirms 9 gadiem
Ben Kurtovic
1327f57404
Fix backgrounds when the image has been moved.
pirms 9 gadiem
Ben Kurtovic
f2c1d5b853
Make cache.langs, cache.projects sorted lists.
pirms 10 gadiem
Ben Kurtovic
0f4ecfe80a
Auto-bypass cache when cache_possible_miss is True.
pirms 10 gadiem
Ben Kurtovic
81552f28d3
Bugfix; missed column.
pirms 10 gadiem
Ben Kurtovic
10e75c7b7a
Cache 'possible_miss' result value.
pirms 10 gadiem
Ben Kurtovic
331b05baae
Truncate URLs above 1024 chars.
pirms 10 gadiem
Ben Kurtovic
a07d12699a
Finally fix #3 ; speed up highlighter with a deque.
pirms 10 gadiem
Ben Kurtovic
f4ff4099be
Fix old reference to query.all_projects.
pirms 10 gadiem
Ben Kurtovic
a1d8b79265
Fix recursion bug.
pirms 10 gadiem
Ben Kurtovic
6eed95c761
Use a global cache instead of the request-specific one.
pirms 10 gadiem
Ben Kurtovic
ddcf2a8963
Store what was previously in SQL tables in flask.g.
pirms 10 gadiem
Ben Kurtovic
9b07a5e9dc
OrderedDict requires extra parens.
pirms 10 gadiem
Ben Kurtovic
340ba07c9d
Generate API results as ordered dicts.
pirms 10 gadiem
Ben Kurtovic
6d87ad41ed
Rearrange functions in api.py.
pirms 10 gadiem
Ben Kurtovic
2fad6b0b60
Description of API response data.
pirms 10 gadiem
Ben Kurtovic
89feddba98
Some minor tweaks, plus documentation for API parameters.
pirms 10 gadiem
Ben Kurtovic
6639338fa8
First version of new API (for #7 )
pirms 10 gadiem
Ben Kurtovic
6f574e37a2
Pluralize cache times correctly.
pirms 10 gadiem
Ben Kurtovic
db7a4f2ec4
Clean up query data handling.
pirms 10 gadiem
Ben Kurtovic
b3e1862726
Catch and display search errors more usefully.
pirms 10 gadiem
Ben Kurtovic
b7bafb29cf
Implement noskip.
pirms 10 gadiem
Ben Kurtovic
73315e3370
FML.
pirms 10 gadiem
Ben Kurtovic
5242d31a93
Whoops.
pirms 10 gadiem
Ben Kurtovic
0731eb089d
Cleanup; fix confidence value comparison; fix.
pirms 10 gadiem
Ben Kurtovic
7cd9a53209
Fix oursql coercing binary data to unicode.
pirms 10 gadiem
Ben Kurtovic
b2fba1fa56
Implement new caching system and result structure.
pirms 10 gadiem
Ben Kurtovic
c6b9c9ebe6
Really fix checkbox blank value treatment.
pirms 10 gadiem
Ben Kurtovic
6cd194f971
Catch empty method error; fix blank values in query.
pirms 10 gadiem
Ben Kurtovic
b90e5e7279
Fix logic.
pirms 10 gadiem
Ben Kurtovic
c165e97fb2
Handle empty URLs from cached results correctly.
pirms 10 gadiem
Ben Kurtovic
710447a6bf
Slightly modified interface; give more options ( closes #17 ).
pirms 10 gadiem
Ben Kurtovic
8f6a02a8f3
If no scheme is given for the URL, add http://
pirms 10 gadiem
Ben Kurtovic
8d80eccad0
T_POSSIBLE back down to 0.4.
pirms 10 gadiem
Ben Kurtovic
490022910f
Update T_POSSIBLE, T_SUSPECT.
pirms 10 gadiem
Ben Kurtovic
9e13ac1c04
Increase log message verbosity; fix import.
pirms 10 gadiem
Ben Kurtovic
ce96f0f8a6
Use a yellow 'possible' box for confidence between 0.3 and 0.6 ( closes #5 ).
pirms 10 gadiem
Ben Kurtovic
75b627cfa5
Refactor out page.copyvio_compare() and surrounding code.
pirms 10 gadiem
Ben Kurtovic
cf66804fe5
Give a different error message for timeouts. Cheating a bit.
pirms 10 gadiem
Ben Kurtovic
7d426a4eb0
Support following redirects ( closes #6 ).
pirms 10 gadiem