Ben Kurtovic
|
4e8be871b7
|
Update copyright year for 2015.
|
9 yıl önce |
Ben Kurtovic
|
0bdcbca8b0
|
Rudimentary solution for PDF parsing (closes earwig/copyvios#18)
|
10 yıl önce |
Ben Kurtovic
|
30f72df470
|
Refactor parsers; fix empty document behavior.
|
10 yıl önce |
Ben Kurtovic
|
5349179088
|
Fix parsing of plain text documents (earwig/copyvios#3)
|
10 yıl önce |
Ben Kurtovic
|
7137dda920
|
Update copyvio checker to not make concurrent requests to a single domain.
|
10 yıl önce |
Ben Kurtovic
|
39d5c7c149
|
Update copyright notices for 2014.
|
11 yıl önce |
Ben Kurtovic
|
ed95c99f0e
|
Update email address.
|
11 yıl önce |
Ben Kurtovic
|
c3dbc9cc84
|
Missed a few years.
|
11 yıl önce |
Ben Kurtovic
|
655624c2cf
|
Lazy-importing of py-bcrypt and pycrypto; restructured deps in setup.
|
12 yıl önce |
Ben Kurtovic
|
9fac24b763
|
Support passing kwargs to tasks in util, plus some other updates.
|
12 yıl önce |
Ben Kurtovic
|
4baab6f57c
|
Implement lazy importing of root-level modules and packages.
- Simplify all imports
- Update dependency version in setup.py
- Change waitTime default from three seconds to two
|
12 yıl önce |
Ben Kurtovic
|
569c815d99
|
Implement NLTK for chunking article content (#5).
|
12 yıl önce |
Ben Kurtovic
|
cb87004107
|
Primitive screen scraper for HTML using BeautifulSoup and LXML.
Obviously this can and should be improved significantly later, but it seems
good enough for now.
|
12 yıl önce |
Ben Kurtovic
|
6be8fb48dd
|
Cleaning up project-wide copyright notices.
|
12 yıl önce |
Ben Kurtovic
|
e9be04de37
|
Fix to finish feature/commands merge (#28).
|
12 yıl önce |
Ben Kurtovic
|
4b1d745e2c
|
Handle timezones correctly with pytz.
|
12 yıl önce |
Ben Kurtovic
|
ccc0440ebe
|
Massively shorten README; integrate Page with mwparserfromhell.
|
12 yıl önce |
Ben Kurtovic
|
e67ea9a91f
|
Replacing custom Blowfish implementation with pycrypto
|
12 yıl önce |
Ben Kurtovic
|
362db3d1c8
|
git version info, logging updates, utility
|
12 yıl önce |
Ben Kurtovic
|
54739d5553
|
Update references to Wiki Toolset, plus other fixes
|
12 yıl önce |
Ben Kurtovic
|
e473653507
|
Updates to setup.py, unit tests, and logging
|
12 yıl önce |
Ben Kurtovic
|
45a472c2ef
|
setup.py should be good now
|
12 yıl önce |
Ben Kurtovic
|
abe58a07f6
|
CommandManager as attr of Bot, plus cleanup
|
12 yıl önce |
Ben Kurtovic
|
117eccc35d
|
Beginning work (#16)
|
12 yıl önce |
Ben Kurtovic
|
65a01c66b5
|
Removed earwigbot.rules; moved its contents to config.yml
|
12 yıl önce |
Ben Kurtovic
|
424a954af6
|
Moving rest of earwigbot.classes to earwigbot.tasks
|
12 yıl önce |
Ben Kurtovic
|
e54db76757
|
Cleanup; ensure unit tests are functional
|
12 yıl önce |
Ben Kurtovic
|
2da906109b
|
Copyright update for 2012.
|
13 yıl önce |
Ben Kurtovic
|
bff00f9b28
|
Restruturing codebase to be a bit more Pythonic.
|
13 yıl önce |