47 Commits (afc5980683d4e0949d07fea0e3837b8b7622267f)

Autor SHA1 Mensagem Data
  Ben Kurtovic afc5980683 Rewrite much of the indexer to use GitPython. 10 anos atrás
  Ben Kurtovic 3dbdf86ff7 Fix. 10 anos atrás
  Ben Kurtovic 4c6f4039a2 Ugly, but fixes a crawler threading bug. 10 anos atrás
  Ben Kurtovic ac5f0981cc Rearrange sleep. 10 anos atrás
  Ben Kurtovic 1c0c4104e5 Support crawling specific repos; add some logging. 10 anos atrás
  Ben Kurtovic bf25b3af66 Don't configure logging twice. 10 anos atrás
  Ben Kurtovic 53a8ad91fa Fix for symbol locs. 10 anos atrás
  Ben Kurtovic 91ab08f99c Try something. 10 anos atrás
  Ben Kurtovic d609c233a1 Attempt to fix /tmp race condition. 10 anos atrás
  Ben Kurtovic 11b460eaa0 Fix repo names. 10 anos atrás
  Ben Kurtovic e77de2305c Start working on new language system. 10 anos atrás
  Benjamin Attal 2d643b1069 Stop ruby parser from failing. Add other parser fixes. Should be good 10 anos atrás
  Ben Kurtovic ddcb5b221f Use logs to calculate ranks (closes #61). 10 anos atrás
  Ben Kurtovic 10e7491a40 Fix indexer breaking http:// URLs. 10 anos atrás
  Ben Kurtovic 1015298109 Make it easy to stop crawler/parsers. Cleanup. 10 anos atrás
  Benjamin Attal 4202552a1e Remove unecessary import 10 anos atrás
  Benjamin Attal 21cf52ea65 Call start_parse_servers from crawl.py 10 anos atrás
  Ben Kurtovic f02dc4497c Fixes. 10 anos atrás
  Severyn Kozak 94953624c8 Fix #34. 10 anos atrás
  Ben Kurtovic 5a83720617 Strip encoding lines. 10 anos atrás
  Severyn Kozak fc8d478060 Untested fix #33. 10 anos atrás
  Ben Kurtovic a3eacc287e Try to make exception reporting more useful. 10 anos atrás
  Ben Kurtovic 9f935bbb74 This is ugly, but it improves the current setup. 10 anos atrás
  Severyn Kozak b698a16c98 Add parse() and insert() calls to crawler. 10 anos atrás
  Severyn Kozak f8436fa484 Part of #26. Move __init__.py to crawl.py. 10 anos atrás
  Severyn Kozak 7c5c9fc7e1 Add GitHub stars, Bitbucket watchers; close #14. 10 anos atrás
  Severyn Kozak d142f1fd55 Complete Crawler. Close #15, #14, #11, #8. 10 anos atrás
  Severyn Kozak 6762c1fa3d Re-add logging, rem file filters. 10 anos atrás
  Severyn Kozak 1b2739f8c4 Add GitHub repo star count, simple logging. 10 anos atrás
  Severyn Kozak ad7ce9d9cf Commit latest crawler, continue fix of #8. 10 anos atrás
  Severyn Kozak f38772760b Remove some subprocesses, comment out logging. 10 anos atrás
  Severyn Kozak 2954161747 Add partially integrated BitbucketCrawler(). 10 anos atrás
  Severyn Kozak 93ed68645d Add partially integrated BitbucketCrawler(). 10 anos atrás
  Severyn Kozak 6718650a8c First part of #8 fix. 10 anos atrás
  Severyn Kozak 3ce399adbf Add threaded cloner, GitRepository class (#7). 10 anos atrás
  Severyn Kozak 755dce6ae3 Add logging to crawler/indexer. 10 anos atrás
  Severyn Kozak f4b28e6178 Add file-ext regex rules, exception handlers. 10 anos atrás
  Severyn Kozak 627c848f20 Add tested indexer. 10 anos atrás
  Severyn Kozak b680756f8d Test crawler, complete documentation. 10 anos atrás
  Severyn Kozak b7ccec0501 Add untested threaded indexer/crawler prototype. 10 anos atrás
  Severyn Kozak 97198ee523 Update Crawler documentation. 10 anos atrás
  Severyn Kozak c655d97f48 Add class ChangeDir, amend unsafe subprocess. 10 anos atrás
  Severyn Kozak 9fc4598001 Clean up crawler/, fix minor bugs. 10 anos atrás
  Severyn Kozak 77b448c3de Mod Codelet, mov codelet creation from crawler. 10 anos atrás
  Severyn Kozak ef9c0609fe Mov author_files > git_inder, heavily refactor. 10 anos atrás