Ben Kurtovic
86a8440730
Moving parsers to own file.
vor 12 Jahren
Ben Kurtovic
d4e947b98b
earwigbot.wiki.copyvios.search module split
vor 12 Jahren
Ben Kurtovic
e6a381f3f7
Restructuring copyvio stuff as its own package.
vor 12 Jahren
Ben Kurtovic
9434a416a1
Moved search engine/credential info into config proper.
- In config.json, search config relocated from
tasks.afc_copyvios to wiki.
- Site.__init__() takes a `search_config' argument, which is
auto-supplied from its value in config.json by get_site().
- Page.copyvio_check() doesn't ask for search config
anymore, meaning doing checks from the command line
is less painful.
- Added a Page.copyvio_compare() function, which works
just like copyvio_check() but on a specified URL; this is
for cache retrieval on the web front-end.
vor 12 Jahren
Ben Kurtovic
7cc85f9bc4
afc_copyvios: optionally cache results for the Toolserver.
vor 12 Jahren
Ben Kurtovic
f382ceb38e
Pushing some smarter logic for MarkovChains
- Incomplete; need this for the TS rewrite
- Also starting work on docstrings for some methods
vor 12 Jahren
Ben Kurtovic
7e6f1e8128
soxred93 -> tparis; toolserver account expired
vor 12 Jahren
Ben Kurtovic
755dff9714
Copyvios: auto-fail very small articles (< 20 chain links)
vor 12 Jahren
Ben Kurtovic
6009c050f9
Minor integer division fix.
vor 12 Jahren
Ben Kurtovic
df7868da3e
Updates to copyright violation stuff.
vor 12 Jahren
Ben Kurtovic
ee2b1133bb
Algorithm for comparing article content against a suspected source using MarkovChains
vor 13 Jahren
Ben Kurtovic
2da906109b
Copyright update for 2012.
vor 13 Jahren
Ben Kurtovic
13100533b9
CopyrightMixin needs Page._site
vor 13 Jahren
Ben Kurtovic
c48073515b
#wikipedia-en-afc -> #wikipedia-en-afc-feed
vor 13 Jahren
Ben Kurtovic
24f7eabb77
Some more work on copyvio detection code
Also removed the hardcoded version in user-agent strings.
vor 13 Jahren
Ben Kurtovic
56e6140284
More work on copyright violation detection code.
vor 13 Jahren
Ben Kurtovic
0b6d5eac5e
Some code for copyvio detection, including querying Yahoo! BOSS correctly.
vor 13 Jahren
Ben Kurtovic
42081ab1c7
Reorder formatting so we don't destroy our datetime object before we're done with it.
vor 13 Jahren
Ben Kurtovic
53533e1821
Support for a hidden date sortkey.
Also fixing a bug in get_notes() with regards to the 'old' note/warning.
vor 13 Jahren
Ben Kurtovic
15748c6edc
Fixing unit test code.
vor 13 Jahren
Ben Kurtovic
27867763eb
Disabling auto-linker.
vor 13 Jahren
Ben Kurtovic
bff00f9b28
Restruturing codebase to be a bit more Pythonic.
vor 13 Jahren
Ben Kurtovic
dbe9b57153
Sleep a bit more logically, adjust color.
vor 13 Jahren
Ben Kurtovic
8f2b82b254
Wrong axis!
vor 13 Jahren
Ben Kurtovic
3c343a175b
Looks like the axis kwarg was added to Axes matlab>1.0.1
vor 13 Jahren
Ben Kurtovic
3858692247
Fix.
vor 13 Jahren
Ben Kurtovic
63a30d2247
Prettify chart a bit.
vor 13 Jahren
Ben Kurtovic
bb6e8f1063
Fix bottom kwarg for p3.
vor 13 Jahren
Ben Kurtovic
b313412905
Fixes/improvements.
vor 13 Jahren
Ben Kurtovic
2f86fe5b07
Fix data reversal.
vor 13 Jahren
Ben Kurtovic
85af03df25
self.dest -> self.destination
vor 13 Jahren
Ben Kurtovic
1701016a7d
Graphing with matplotlib.
vor 13 Jahren
Ben Kurtovic
149fe8fdb4
Fix.
vor 13 Jahren
Ben Kurtovic
4eca6a83e7
Forgot to int() custom num_days; fix generate().
vor 13 Jahren
Ben Kurtovic
522e44abc4
Oops.
vor 13 Jahren
Ben Kurtovic
ed8751970f
Fix update_date() when a submission is in multiple date categories.
vor 13 Jahren
Ben Kurtovic
5fc281336a
Fix query in get_status().
vor 13 Jahren
Ben Kurtovic
656d904515
Sleep a bit in between API queries.
vor 13 Jahren
Ben Kurtovic
396a21c4a8
CHART_ -> STATUS_
vor 13 Jahren
Ben Kurtovic
6bb438b60b
Fixes.
vor 13 Jahren
Ben Kurtovic
13476460c9
Replag command; fix.
vor 13 Jahren
Ben Kurtovic
a9f07c2611
Forgot import config >_>
vor 13 Jahren
Ben Kurtovic
d949269944
Some SQL updates, starting work on afc_history task.
* get() -> return a Task instance by name (tasks)
* Using SQL to save API queries. (commands.{afc_report,afc_status})
* ignore_list -> ignoreList in config. (tasks.afc_statistics)
vor 13 Jahren
Ben Kurtovic
3e5801d5e6
!status nocolor for HallowsAG
vor 13 Jahren
Ben Kurtovic
5088315db4
Differentiate between completely blank pages and nonexistant or unretrievable pages.
vor 13 Jahren
Ben Kurtovic
609b1cce75
Misplaced submissions can't be 'resumbmitted'; page_special_user is only significant for CHART_PEND and CHART_DRAFT.
vor 13 Jahren
Ben Kurtovic
7757af58fd
Fix; get_size() rounds to tenths instead of hundreds.
vor 13 Jahren
Ben Kurtovic
1c850764d7
Fixes in Unicode, status determination; removed created.
vor 13 Jahren
Ben Kurtovic
9ec03520b3
Fix.
vor 13 Jahren
Ben Kurtovic
e02ff2d5db
Fix get_status_and_chart(content, namespace)
vor 13 Jahren