Frances Hocutt
4e994f1302
Refactor turnitin.py, incorporate diff link/timestamp
* Add a wiki timestamp parser to copyvios/misc.py
* Refactor copyvios/turnitin.py for more sensible structure
* Update templates/index.mako to incorporate diff link/timestamp and
make it clearer that Turnitin is revision-based checking
9 years ago
Frances Hocutt
1ffa87da0b
Improve turnitin.py docstrings, fix bugs
9 years ago
Frances Hocutt
bf0aa22fa9
[WIP] improve docstrings and naming, mark TODO
9 years ago
Frances Hocutt
efe2300250
[WIP] Basic working integration of turnitin
Add a checkbox to allow searching the EranBot/plagiabot database for
Turnitin results, and display them in a similar form to the on-wiki
EranBot reports if they exist.
Add a new module (copyvios/turnitin.py) to handle fetching and parsing
the EranBot results.
Bug: https://phabricator.wikimedia.org/T110144
TODO: tweak display HTML/CSS; refactor/clean up turnitin.py;
improve dev set-up so it doesn't always default to testwiki
and can test without hardcoding page title
9 years ago
Ben Kurtovic
ad033a6e2b
Pass excluded URL data to the API.
9 years ago
Ben Kurtovic
321e6e3352
Support new 'excluded' column in database.
9 years ago
Ben Kurtovic
4092cbc0f6
Keep all workers' backgrounds in sync, and ensure they update at 00:00 UTC.
9 years ago
Ben Kurtovic
b325ef84cd
Support general/update notices.
9 years ago
Ben Kurtovic
1327f57404
Fix backgrounds when the image has been moved.
9 years ago
Ben Kurtovic
f2c1d5b853
Make cache.langs, cache.projects sorted lists.
10 years ago
Ben Kurtovic
0f4ecfe80a
Auto-bypass cache when cache_possible_miss is True.
10 years ago
Ben Kurtovic
81552f28d3
Bugfix; missed column.
10 years ago
Ben Kurtovic
10e75c7b7a
Cache 'possible_miss' result value.
10 years ago
Ben Kurtovic
331b05baae
Truncate URLs above 1024 chars.
10 years ago
Ben Kurtovic
a07d12699a
Finally fix #3 ; speed up highlighter with a deque.
10 years ago
Ben Kurtovic
f4ff4099be
Fix old reference to query.all_projects.
10 years ago
Ben Kurtovic
a1d8b79265
Fix recursion bug.
10 years ago
Ben Kurtovic
6eed95c761
Use a global cache instead of the request-specific one.
10 years ago
Ben Kurtovic
ddcf2a8963
Store what was previously in SQL tables in flask.g.
10 years ago
Ben Kurtovic
9b07a5e9dc
OrderedDict requires extra parens.
10 years ago
Ben Kurtovic
340ba07c9d
Generate API results as ordered dicts.
10 years ago
Ben Kurtovic
6d87ad41ed
Rearrange functions in api.py.
10 years ago
Ben Kurtovic
2fad6b0b60
Description of API response data.
10 years ago
Ben Kurtovic
89feddba98
Some minor tweaks, plus documentation for API parameters.
10 years ago
Ben Kurtovic
6639338fa8
First version of new API (for #7 )
10 years ago
Ben Kurtovic
6f574e37a2
Pluralize cache times correctly.
10 years ago
Ben Kurtovic
db7a4f2ec4
Clean up query data handling.
10 years ago
Ben Kurtovic
b3e1862726
Catch and display search errors more usefully.
10 years ago
Ben Kurtovic
b7bafb29cf
Implement noskip.
10 years ago
Ben Kurtovic
73315e3370
FML.
10 years ago
Ben Kurtovic
5242d31a93
Whoops.
10 years ago
Ben Kurtovic
0731eb089d
Cleanup; fix confidence value comparison; fix.
10 years ago
Ben Kurtovic
7cd9a53209
Fix oursql coercing binary data to unicode.
10 years ago
Ben Kurtovic
b2fba1fa56
Implement new caching system and result structure.
10 years ago
Ben Kurtovic
c6b9c9ebe6
Really fix checkbox blank value treatment.
10 years ago
Ben Kurtovic
6cd194f971
Catch empty method error; fix blank values in query.
10 years ago
Ben Kurtovic
b90e5e7279
Fix logic.
10 years ago
Ben Kurtovic
c165e97fb2
Handle empty URLs from cached results correctly.
10 years ago
Ben Kurtovic
710447a6bf
Slightly modified interface; give more options ( closes #17 ).
10 years ago
Ben Kurtovic
8f6a02a8f3
If no scheme is given for the URL, add http://
10 years ago
Ben Kurtovic
8d80eccad0
T_POSSIBLE back down to 0.4.
10 years ago
Ben Kurtovic
490022910f
Update T_POSSIBLE, T_SUSPECT.
10 years ago
Ben Kurtovic
9e13ac1c04
Increase log message verbosity; fix import.
10 years ago
Ben Kurtovic
ce96f0f8a6
Use a yellow 'possible' box for confidence between 0.3 and 0.6 ( closes #5 ).
10 years ago
Ben Kurtovic
75b627cfa5
Refactor out page.copyvio_compare() and surrounding code.
10 years ago
Ben Kurtovic
cf66804fe5
Give a different error message for timeouts. Cheating a bit.
10 years ago
Ben Kurtovic
7d426a4eb0
Support following redirects ( closes #6 ).
10 years ago
Ben Kurtovic
a08bd49a1d
Support searching by revision ID ( closes #8 ).
10 years ago
Ben Kurtovic
ca09caa15d
Give an error when no text could be found inside a URL.
10 years ago
Ben Kurtovic
9a25f97831
Connection pooling; cleanup.
10 years ago