47 Commity (afc5980683d4e0949d07fea0e3837b8b7622267f)

Autor SHA1 Wiadomość Data
  Ben Kurtovic afc5980683 Rewrite much of the indexer to use GitPython. 10 lat temu
  Ben Kurtovic 3dbdf86ff7 Fix. 10 lat temu
  Ben Kurtovic 4c6f4039a2 Ugly, but fixes a crawler threading bug. 10 lat temu
  Ben Kurtovic ac5f0981cc Rearrange sleep. 10 lat temu
  Ben Kurtovic 1c0c4104e5 Support crawling specific repos; add some logging. 10 lat temu
  Ben Kurtovic bf25b3af66 Don't configure logging twice. 10 lat temu
  Ben Kurtovic 53a8ad91fa Fix for symbol locs. 10 lat temu
  Ben Kurtovic 91ab08f99c Try something. 10 lat temu
  Ben Kurtovic d609c233a1 Attempt to fix /tmp race condition. 10 lat temu
  Ben Kurtovic 11b460eaa0 Fix repo names. 10 lat temu
  Ben Kurtovic e77de2305c Start working on new language system. 10 lat temu
  Benjamin Attal 2d643b1069 Stop ruby parser from failing. Add other parser fixes. Should be good 10 lat temu
  Ben Kurtovic ddcb5b221f Use logs to calculate ranks (closes #61). 10 lat temu
  Ben Kurtovic 10e7491a40 Fix indexer breaking http:// URLs. 10 lat temu
  Ben Kurtovic 1015298109 Make it easy to stop crawler/parsers. Cleanup. 10 lat temu
  Benjamin Attal 4202552a1e Remove unecessary import 10 lat temu
  Benjamin Attal 21cf52ea65 Call start_parse_servers from crawl.py 10 lat temu
  Ben Kurtovic f02dc4497c Fixes. 10 lat temu
  Severyn Kozak 94953624c8 Fix #34. 10 lat temu
  Ben Kurtovic 5a83720617 Strip encoding lines. 10 lat temu
  Severyn Kozak fc8d478060 Untested fix #33. 10 lat temu
  Ben Kurtovic a3eacc287e Try to make exception reporting more useful. 10 lat temu
  Ben Kurtovic 9f935bbb74 This is ugly, but it improves the current setup. 10 lat temu
  Severyn Kozak b698a16c98 Add parse() and insert() calls to crawler. 10 lat temu
  Severyn Kozak f8436fa484 Part of #26. Move __init__.py to crawl.py. 10 lat temu
  Severyn Kozak 7c5c9fc7e1 Add GitHub stars, Bitbucket watchers; close #14. 10 lat temu
  Severyn Kozak d142f1fd55 Complete Crawler. Close #15, #14, #11, #8. 10 lat temu
  Severyn Kozak 6762c1fa3d Re-add logging, rem file filters. 10 lat temu
  Severyn Kozak 1b2739f8c4 Add GitHub repo star count, simple logging. 10 lat temu
  Severyn Kozak ad7ce9d9cf Commit latest crawler, continue fix of #8. 10 lat temu
  Severyn Kozak f38772760b Remove some subprocesses, comment out logging. 10 lat temu
  Severyn Kozak 2954161747 Add partially integrated BitbucketCrawler(). 10 lat temu
  Severyn Kozak 93ed68645d Add partially integrated BitbucketCrawler(). 10 lat temu
  Severyn Kozak 6718650a8c First part of #8 fix. 10 lat temu
  Severyn Kozak 3ce399adbf Add threaded cloner, GitRepository class (#7). 10 lat temu
  Severyn Kozak 755dce6ae3 Add logging to crawler/indexer. 10 lat temu
  Severyn Kozak f4b28e6178 Add file-ext regex rules, exception handlers. 10 lat temu
  Severyn Kozak 627c848f20 Add tested indexer. 10 lat temu
  Severyn Kozak b680756f8d Test crawler, complete documentation. 10 lat temu
  Severyn Kozak b7ccec0501 Add untested threaded indexer/crawler prototype. 10 lat temu
  Severyn Kozak 97198ee523 Update Crawler documentation. 10 lat temu
  Severyn Kozak c655d97f48 Add class ChangeDir, amend unsafe subprocess. 10 lat temu
  Severyn Kozak 9fc4598001 Clean up crawler/, fix minor bugs. 10 lat temu
  Severyn Kozak 77b448c3de Mod Codelet, mov codelet creation from crawler. 10 lat temu
  Severyn Kozak ef9c0609fe Mov author_files > git_inder, heavily refactor. 10 lat temu