Ben Kurtovic
4fff908912
Cleanup; fix site locking mechanism; badtoken handling.
11 vuotta sitten
Ben Kurtovic
1bfe5c2ee7
Fix error handling in get_token().
11 vuotta sitten
Ben Kurtovic
3d5be6c627
Fix token-getter code and refactor into a get_token() function.
11 vuotta sitten
Ben Kurtovic
08af882841
Update AssertEdit behavior now that it's been merged into MW core.
Note: don't deploy until January 14.
11 vuotta sitten
Ben Kurtovic
39d5c7c149
Update copyright notices for 2014.
11 vuotta sitten
Ben Kurtovic
ed95c99f0e
Update email address.
11 vuotta sitten
Ben Kurtovic
a3eee798f9
Implement Category.__iter__()
11 vuotta sitten
Ben Kurtovic
a926431754
Better client-side buffering of SQL results.
11 vuotta sitten
Ben Kurtovic
b88181bb24
Make exclusion check a bit smarter.
11 vuotta sitten
Ben Kurtovic
48e6c14ab8
WHOOPS X2.
11 vuotta sitten
Ben Kurtovic
fd5852a6d0
RIP Toolserver. (Some fixes for Labs compatibilty.)
11 vuotta sitten
Ben Kurtovic
5931f375de
Put response.read() in the try:, since that's what throws the timeout.
11 vuotta sitten
Ben Kurtovic
f082fca720
So re.MULTILINE is necessary for those rules.
12 vuotta sitten
Ben Kurtovic
333201226b
Once again, reengineer regexes.
12 vuotta sitten
Ben Kurtovic
8c524d9c30
Whoops, those patterns are supposed to be ignored.
12 vuotta sitten
Ben Kurtovic
ab216fb834
<nowiki> tags in exclusion lists are optional.
12 vuotta sitten
Ben Kurtovic
0b7a13eca5
Update copyright notices for 2013.
12 vuotta sitten
Ben Kurtovic
6a381c66f1
Fix a bug regarding updating the 'all' site.
12 vuotta sitten
Ben Kurtovic
c36b45771a
Make [[User:EarwigBot/Copyvios/Exclusions]] project-insensitive.
12 vuotta sitten
Ben Kurtovic
4ff7612a27
Fix when we've cached None as a url.
12 vuotta sitten
Ben Kurtovic
25d75c5d11
Forgot to give process_time param to CopyvioCheckResult.
12 vuotta sitten
Ben Kurtovic
8862bec3d9
Fix statements assigned to nothing.
12 vuotta sitten
Ben Kurtovic
bcf9b70107
Keep track of how long generating results takes; support 'max_time'.
12 vuotta sitten
Ben Kurtovic
de23112f43
Bugfix when we can't read HTML.
12 vuotta sitten
Ben Kurtovic
a4dda89a61
Various fixes for copyvios.
- Fix a bug in ExclusionsDB; improve URL regexes.
- NLTK's LookupError is actually an IOError.
- Fix bug in __repr__ for CopyvioCheckResult.
- Rewrite YahooBOSSSearchEngine to actually work with oauth2.
- Search engines now take a URL opener in addition to credentials.
12 vuotta sitten
Ben Kurtovic
d709ee00a7
Better handling when we are blocked from editing
.
12 vuotta sitten
Ben Kurtovic
655624c2cf
Lazy-importing of py-bcrypt and pycrypto; restructured deps in setup.
12 vuotta sitten
Ben Kurtovic
ac6de461bb
Implement lazy-importing of oursql and pytz.
12 vuotta sitten
Ben Kurtovic
0ca84ab9bc
Implement lazy-importing of oauth2, nltk, and bs4.
12 vuotta sitten
Ben Kurtovic
c0e309bbd9
Simplify a bit of code in Page
12 vuotta sitten
Ben Kurtovic
a7e243fdfb
Update mwparserfromhell usage to reflect current API.
12 vuotta sitten
Ben Kurtovic
4baab6f57c
Implement lazy importing of root-level modules and packages.
- Simplify all imports
- Update dependency version in setup.py
- Change waitTime default from three seconds to two
12 vuotta sitten
Ben Kurtovic
8d8703358c
More fixes and tweaks; cleanup; etc.
12 vuotta sitten
Ben Kurtovic
2d4b31cde9
OrderedLoader/OrderedDumper to... preserve order... plus some cleanup.
12 vuotta sitten
Ben Kurtovic
7d7205265e
Implement the rest of _set_wiki(); _ask(); other tweaks.
12 vuotta sitten
Ben Kurtovic
d4e9106667
Implement remove_admin()/owner(); fixing some other things.
12 vuotta sitten
Ben Kurtovic
fc610867e9
Implement User.is_up; two bugfixes/enhancements in the clerkbot.
12 vuotta sitten
Ben Kurtovic
7d52d4558a
Some updates to !lag.
12 vuotta sitten
Ben Kurtovic
33aa1d6744
Collapse extra newlines to avoid distorting trigrams.
12 vuotta sitten
Ben Kurtovic
c19a1bde2e
Determine which cookies apply to which domains better.
12 vuotta sitten
Ben Kurtovic
3f1737717c
More accurate handling when using get_site(lang, project).
Some wikis (like frwiki), set their projects to localized forms of
the expected (so "wikipedia" becomes "wikipédia"). An additional
search by URL in _get_site_name_from_sitesdb() fixes this.
12 vuotta sitten
Ben Kurtovic
46c3561ce8
Apparently cookie values are plus-quoted. Oops!
12 vuotta sitten
Ben Kurtovic
b784fc7cd8
Fix a silly bug in passing cookiejars to Site objects.
* Apparently bool(cookiejar) is False if the cookiejar doesn't
contain any cookies, even if it exists. Change `if cookiejar` to
`if cookiejar is not None` to get around this.
12 vuotta sitten
Ben Kurtovic
c198ce27a7
Tuple, not list.
12 vuotta sitten
Ben Kurtovic
e71e9daadb
Store update_time as an int, not a float.
12 vuotta sitten
Ben Kurtovic
fcd7fddbdd
MD5 can't handle Unicode, so fully implement saving Unicode objects by fixing that.
12 vuotta sitten
Ben Kurtovic
f993b847ab
Encode URLs as UTF-8 before opening them.
12 vuotta sitten
Ben Kurtovic
e1ff76ba9f
gettattr -> getattr
12 vuotta sitten
Ben Kurtovic
482db43b2e
Improve assert=bot.
12 vuotta sitten
Ben Kurtovic
570168ed0e
Institute a timeout so we don't try to open these suspicious URLs forever.
12 vuotta sitten