Ben Kurtovic
b134ef67ba
Even better error handling
3 lat temu
Ben Kurtovic
c829590bc9
Bugfix
3 lat temu
Ben Kurtovic
ad87596e37
Better error handling for revision IDs
3 lat temu
Ben Kurtovic
eacb294735
Update URLs
4 lat temu
Ben Kurtovic
4cebe36eee
Validate site info if not in sites DB
5 lat temu
Ben Kurtovic
96ac14ec38
Use rvslots for fetching revision content
5 lat temu
Ben Kurtovic
c73179b28c
Update highlighter for new MarkovChain implementation
5 lat temu
Ben Kurtovic
01cd6dc763
Make caching advisory and reduce db lock duration
5 lat temu
Ben Kurtovic
8321a07ba9
More additions for sqlite support
5 lat temu
Ben Kurtovic
ca58aac413
Add sqlite3 backend to replace mysql
5 lat temu
Ben Kurtovic
29861c1118
Revert previous; literal_eval is necessary here. Add error handling.
6 lat temu
Ben Kurtovic
a63e4e2236
Safer parsing for Turnitin API result.
6 lat temu
Ben Kurtovic
79866da842
Strip whitespace.
7 lat temu
Ben Kurtovic
088dfe6197
Again, fix link and cleanup.
7 lat temu
Ben Kurtovic
21aad51375
Fix unnecessary extra work.
7 lat temu
Ben Kurtovic
63f1ae98c4
Show warning when a page has an attribution template.
7 lat temu
Ben Kurtovic
a2df4bba0a
Tweak parameters.
8 lat temu
Ben Kurtovic
1076959f71
Bugfix in highlight_delta() call; clarify docs.
8 lat temu
Ben Kurtovic
2a81217de8
Add support for detailed text comparison in API (T132949)
8 lat temu
Ben Kurtovic
8be5c63bc1
Turnitin bugfix.
8 lat temu
Ben Kurtovic
137c33bfcb
Fix _format_date() for caches older than a day.
8 lat temu
Ben Kurtovic
e6e83ed73c
Revert "ast.literal_eval -> json.loads"
This reverts commit 6d940b5c92
.
8 lat temu
Ben Kurtovic
0692227496
Cleanup/tweaks, mainly to rendering.
8 lat temu
Ben Kurtovic
6d940b5c92
ast.literal_eval -> json.loads
8 lat temu
Ben Kurtovic
83f5588354
Disable turnitin by default for now, until we confirm it works as expected.
8 lat temu
Frances Hocutt
4e994f1302
Refactor turnitin.py, incorporate diff link/timestamp
* Add a wiki timestamp parser to copyvios/misc.py
* Refactor copyvios/turnitin.py for more sensible structure
* Update templates/index.mako to incorporate diff link/timestamp and
make it clearer that Turnitin is revision-based checking
9 lat temu
Frances Hocutt
1ffa87da0b
Improve turnitin.py docstrings, fix bugs
9 lat temu
Frances Hocutt
bf0aa22fa9
[WIP] improve docstrings and naming, mark TODO
9 lat temu
Frances Hocutt
efe2300250
[WIP] Basic working integration of turnitin
Add a checkbox to allow searching the EranBot/plagiabot database for
Turnitin results, and display them in a similar form to the on-wiki
EranBot reports if they exist.
Add a new module (copyvios/turnitin.py) to handle fetching and parsing
the EranBot results.
Bug: https://phabricator.wikimedia.org/T110144
TODO: tweak display HTML/CSS; refactor/clean up turnitin.py;
improve dev set-up so it doesn't always default to testwiki
and can test without hardcoding page title
9 lat temu
Ben Kurtovic
ad033a6e2b
Pass excluded URL data to the API.
9 lat temu
Ben Kurtovic
321e6e3352
Support new 'excluded' column in database.
9 lat temu
Ben Kurtovic
4092cbc0f6
Keep all workers' backgrounds in sync, and ensure they update at 00:00 UTC.
9 lat temu
Ben Kurtovic
b325ef84cd
Support general/update notices.
9 lat temu
Ben Kurtovic
1327f57404
Fix backgrounds when the image has been moved.
9 lat temu
Ben Kurtovic
f2c1d5b853
Make cache.langs, cache.projects sorted lists.
10 lat temu
Ben Kurtovic
0f4ecfe80a
Auto-bypass cache when cache_possible_miss is True.
10 lat temu
Ben Kurtovic
81552f28d3
Bugfix; missed column.
10 lat temu
Ben Kurtovic
10e75c7b7a
Cache 'possible_miss' result value.
10 lat temu
Ben Kurtovic
331b05baae
Truncate URLs above 1024 chars.
10 lat temu
Ben Kurtovic
a07d12699a
Finally fix #3 ; speed up highlighter with a deque.
10 lat temu
Ben Kurtovic
f4ff4099be
Fix old reference to query.all_projects.
10 lat temu
Ben Kurtovic
a1d8b79265
Fix recursion bug.
10 lat temu
Ben Kurtovic
6eed95c761
Use a global cache instead of the request-specific one.
10 lat temu
Ben Kurtovic
ddcf2a8963
Store what was previously in SQL tables in flask.g.
10 lat temu
Ben Kurtovic
9b07a5e9dc
OrderedDict requires extra parens.
10 lat temu
Ben Kurtovic
340ba07c9d
Generate API results as ordered dicts.
10 lat temu
Ben Kurtovic
6d87ad41ed
Rearrange functions in api.py.
10 lat temu
Ben Kurtovic
2fad6b0b60
Description of API response data.
10 lat temu
Ben Kurtovic
89feddba98
Some minor tweaks, plus documentation for API parameters.
10 lat temu
Ben Kurtovic
6639338fa8
First version of new API (for #7 )
10 lat temu