Ben Kurtovic
d4e947b98b
earwigbot.wiki.copyvios.search module split
12 years ago
Ben Kurtovic
ec8df97616
Cleanup a few instances of string concatenation
12 years ago
Ben Kurtovic
cdb7bce734
Avoid backreferences in re.sub() repl to bypass re._cache_repl ( #17 )
12 years ago
Ben Kurtovic
e6a381f3f7
Restructuring copyvio stuff as its own package.
12 years ago
Ben Kurtovic
9434a416a1
Moved search engine/credential info into config proper.
- In config.json, search config relocated from
tasks.afc_copyvios to wiki.
- Site.__init__() takes a `search_config' argument, which is
auto-supplied from its value in config.json by get_site().
- Page.copyvio_check() doesn't ask for search config
anymore, meaning doing checks from the command line
is less painful.
- Added a Page.copyvio_compare() function, which works
just like copyvio_check() but on a specified URL; this is
for cache retrieval on the web front-end.
12 years ago
Ben Kurtovic
7cc85f9bc4
afc_copyvios: optionally cache results for the Toolserver.
12 years ago
Ben Kurtovic
f382ceb38e
Pushing some smarter logic for MarkovChains
- Incomplete; need this for the TS rewrite
- Also starting work on docstrings for some methods
12 years ago
Ben Kurtovic
7e6f1e8128
soxred93 -> tparis; toolserver account expired
12 years ago
Ben Kurtovic
755dff9714
Copyvios: auto-fail very small articles (< 20 chain links)
12 years ago
Ben Kurtovic
6009c050f9
Minor integer division fix.
12 years ago
Ben Kurtovic
df7868da3e
Updates to copyright violation stuff.
12 years ago
Ben Kurtovic
ee2b1133bb
Algorithm for comparing article content against a suspected source using MarkovChains
13 years ago
Ben Kurtovic
2da906109b
Copyright update for 2012.
13 years ago
Ben Kurtovic
13100533b9
CopyrightMixin needs Page._site
13 years ago
Ben Kurtovic
c48073515b
#wikipedia-en-afc -> #wikipedia-en-afc-feed
13 years ago
Ben Kurtovic
24f7eabb77
Some more work on copyvio detection code
Also removed the hardcoded version in user-agent strings.
13 years ago
Ben Kurtovic
56e6140284
More work on copyright violation detection code.
13 years ago
Ben Kurtovic
0b6d5eac5e
Some code for copyvio detection, including querying Yahoo! BOSS correctly.
13 years ago
Ben Kurtovic
42081ab1c7
Reorder formatting so we don't destroy our datetime object before we're done with it.
13 years ago
Ben Kurtovic
53533e1821
Support for a hidden date sortkey.
Also fixing a bug in get_notes() with regards to the 'old' note/warning.
13 years ago
Ben Kurtovic
15748c6edc
Fixing unit test code.
13 years ago
Ben Kurtovic
27867763eb
Disabling auto-linker.
13 years ago
Ben Kurtovic
bff00f9b28
Restruturing codebase to be a bit more Pythonic.
13 years ago
Ben Kurtovic
dbe9b57153
Sleep a bit more logically, adjust color.
13 years ago
Ben Kurtovic
8f2b82b254
Wrong axis!
13 years ago
Ben Kurtovic
3c343a175b
Looks like the axis kwarg was added to Axes matlab>1.0.1
13 years ago
Ben Kurtovic
3858692247
Fix.
13 years ago
Ben Kurtovic
63a30d2247
Prettify chart a bit.
13 years ago
Ben Kurtovic
bb6e8f1063
Fix bottom kwarg for p3.
13 years ago
Ben Kurtovic
b313412905
Fixes/improvements.
13 years ago
Ben Kurtovic
2f86fe5b07
Fix data reversal.
13 years ago
Ben Kurtovic
85af03df25
self.dest -> self.destination
13 years ago
Ben Kurtovic
1701016a7d
Graphing with matplotlib.
13 years ago
Ben Kurtovic
149fe8fdb4
Fix.
13 years ago
Ben Kurtovic
4eca6a83e7
Forgot to int() custom num_days; fix generate().
13 years ago
Ben Kurtovic
522e44abc4
Oops.
13 years ago
Ben Kurtovic
ed8751970f
Fix update_date() when a submission is in multiple date categories.
13 years ago
Ben Kurtovic
5fc281336a
Fix query in get_status().
13 years ago
Ben Kurtovic
656d904515
Sleep a bit in between API queries.
13 years ago
Ben Kurtovic
396a21c4a8
CHART_ -> STATUS_
13 years ago
Ben Kurtovic
6bb438b60b
Fixes.
13 years ago
Ben Kurtovic
13476460c9
Replag command; fix.
13 years ago
Ben Kurtovic
a9f07c2611
Forgot import config >_>
13 years ago
Ben Kurtovic
d949269944
Some SQL updates, starting work on afc_history task.
* get() -> return a Task instance by name (tasks)
* Using SQL to save API queries. (commands.{afc_report,afc_status})
* ignore_list -> ignoreList in config. (tasks.afc_statistics)
13 years ago
Ben Kurtovic
3e5801d5e6
!status nocolor for HallowsAG
13 years ago
Ben Kurtovic
5088315db4
Differentiate between completely blank pages and nonexistant or unretrievable pages.
13 years ago
Ben Kurtovic
609b1cce75
Misplaced submissions can't be 'resumbmitted'; page_special_user is only significant for CHART_PEND and CHART_DRAFT.
13 years ago
Ben Kurtovic
7757af58fd
Fix; get_size() rounds to tenths instead of hundreds.
13 years ago
Ben Kurtovic
1c850764d7
Fixes in Unicode, status determination; removed created.
13 years ago
Ben Kurtovic
9ec03520b3
Fix.
13 years ago