Ben Kurtovic
b9074c9f9d
URL exclusions: fix uppercase characters in patterns never matching
4 роки тому
Ben Kurtovic
88f9c21111
URL exclusions: fix comment parsing
5 роки тому
Ben Kurtovic
1cdc0a5a4c
Improve excluded URL list parsing
5 роки тому
Ben Kurtovic
69cdb41d07
Adjust mirror hints to include direct links back to the article.
8 роки тому
Ben Kurtovic
b4b079ffd0
Update copyright year for 2016.
8 роки тому
Ben Kurtovic
c81d1d949d
Update global exclusion lists more often than site-specific ones.
8 роки тому
Ben Kurtovic
108eca13ac
Finish mirror hinting algorithm.
8 роки тому
Ben Kurtovic
91846ce4fb
Refactor out mirror hinting logic in source parsers.
8 роки тому
Ben Kurtovic
03910b6cb5
Add mirror detection logic to parsers; fixes.
8 роки тому
Ben Kurtovic
4e8be871b7
Update copyright year for 2015.
9 роки тому
Ben Kurtovic
09319b1675
Don't die on broken regexes.
9 роки тому
Ben Kurtovic
4cdfafd487
Skip site check.
9 роки тому
Ben Kurtovic
4075d887e9
Fix return.
9 роки тому
Ben Kurtovic
a2c10650a8
Add support for User:EranBot/Copyright/Blacklist ( closes #52 )
9 роки тому
Ben Kurtovic
b87d5ac673
Pass parameter to recursive call.
9 роки тому
Ben Kurtovic
170f810735
Allow ExclusionDB to force a sync.
9 роки тому
Ben Kurtovic
7afb484cea
Refactor a bunch of copyvio internals. Store all sources with a result object.
9 роки тому
Ben Kurtovic
3a8349e8ab
Allow regexes in exclusion lists.
10 роки тому
Ben Kurtovic
39d5c7c149
Update copyright notices for 2014.
10 роки тому
Ben Kurtovic
ed95c99f0e
Update email address.
10 роки тому
Ben Kurtovic
b88181bb24
Make exclusion check a bit smarter.
10 роки тому
Ben Kurtovic
f082fca720
So re.MULTILINE is necessary for those rules.
11 роки тому
Ben Kurtovic
333201226b
Once again, reengineer regexes.
11 роки тому
Ben Kurtovic
8c524d9c30
Whoops, those patterns are supposed to be ignored.
11 роки тому
Ben Kurtovic
ab216fb834
<nowiki> tags in exclusion lists are optional.
11 роки тому
Ben Kurtovic
0b7a13eca5
Update copyright notices for 2013.
11 роки тому
Ben Kurtovic
6a381c66f1
Fix a bug regarding updating the 'all' site.
11 роки тому
Ben Kurtovic
c36b45771a
Make [[User:EarwigBot/Copyvios/Exclusions]] project-insensitive.
11 роки тому
Ben Kurtovic
a4dda89a61
Various fixes for copyvios.
- Fix a bug in ExclusionsDB; improve URL regexes.
- NLTK's LookupError is actually an IOError.
- Fix bug in __repr__ for CopyvioCheckResult.
- Rewrite YahooBOSSSearchEngine to actually work with oauth2.
- Search engines now take a URL opener in addition to credentials.
11 роки тому
Ben Kurtovic
d4e9106667
Implement remove_admin()/owner(); fixing some other things.
12 роки тому
Ben Kurtovic
e71e9daadb
Store update_time as an int, not a float.
12 роки тому
Ben Kurtovic
37336fc343
Whoops, use correct filename.
12 роки тому
Ben Kurtovic
292f04fe2d
Implemented limited wildcard matching in ExceptionsDB.
12 роки тому
Ben Kurtovic
5ab736723b
Fixing a number of silly mistakes; refactoring (thanks pylint)
12 роки тому
Ben Kurtovic
439b855254
Fully implement logging; fix non-unicode log messages.
12 роки тому
Ben Kurtovic
1c2dcc999a
__repr__ and __str__ for ExclusionsDB ( #5 ).
12 роки тому
Ben Kurtovic
a074da853b
More work on copyvios, including an exclusions database ( #5 )
* Added exclusions module with a fully implemented ExclusionsDB that can pull
from multiple sources for different sites.
* Moved CopyvioCheckResult to its own module, to be imported by __init__.
* Some other related changes.
12 роки тому