2d7c1f4
Fix array out of bounds exception coming from JavaParser.java by
2014-04-20 13:22:15 -0400
64ef9b0
Remove unecessary imports by
2014-04-20 12:22:22 -0400
f451e42
Refactor of the Java Parser Mod: Parser.java: - Moved client reading and writing methods to the abstract parser class, so that it is not specific to the JavaParser by
2014-04-20 12:14:38 -0400
2338887
Working version of java parser up and running. by
2014-04-20 02:16:06 -0400
19a5457
Change director structure for java by
2014-04-20 01:42:57 -0400
306875d
Make Parser implement runnable so parsing tasks can be started in separate threads. Make Parser constructor accept a client socket, add reading and writing methods for the socket to JavaParser. Parse main method sets up a server for accepting parse jobs from the crawler, and starts threads for each parse task. by
2014-04-20 01:41:55 -0400
77e2b6f
Fix errors in java parser, mostly casting issues. In Parse.java, set up a tcp server for communication with python processes. Builds with maven by
2014-04-19 15:26:38 -0400
669c30c
Mod: Parse.java: Added comments by
2014-04-18 22:13:21 -0400
63b09ca
Changed directory structure of java parser. Decided on multiple parsers in different languages, refactored bitshift/parser to fit with that paradigm. by
2014-04-18 19:40:31 -0400
a1066dd
Modify parser/__init__.py so that it communicates with the Java parsing process and reads a result back from a unique file. Add template files for Java parsers. by
2014-04-18 16:31:43 -0400
3bc7482
Refactor parser/__init__.py for new parsing mechanism by
2014-04-18 15:45:12 -0400
430b7d3
Remove unecessary submodule. by
2014-04-18 15:37:21 -0400
a8f918f
Update class names. Move language ids to languages.py by
2014-04-18 15:36:38 -0400
0a57cf5
Add first version of the c parser Add: c.py - CTreeCutter class is very similar to PyTreeCutter. It utilizes self.cache as opposed to PyTreeCutter which doesn't yet. - CTreeCutter visit functions simply add start and end lines of the node to the cache, and visit_Decl pushes the cache onto accum. - parse_c performs a task identical to parse_py. However, many c files need to be pre-processed before they are parsed. by
2014-04-15 13:38:20 -0400
847410b
Minor fix-ups in python parser. Mod: python.py - Add self.cache to allow for saving of unassocaited metadata as the PyTreeCutter moves down the syntax tree. - Update docstrings. by
2014-04-15 13:31:01 -0400
d485b87
Fix docstring in bitshift/parser/python.py by
2014-04-14 19:10:32 -0400
b77db87
Refactor parsing in python by adding node visitor class. Performs same tasks as previous version, but is more concise. Add: bitshift/parser/python.py: Add PyTreeCutter class to perform actions on specific nodes. by
2014-04-14 17:47:48 -0400
4d8c818
Corrected documentation in bitshift/codelet.py and bitshift/parser/__init__.py by
2014-04-14 17:45:53 -0400
5db273a
Bugfixes for _serialize function in bitshift/parser/python.py by
2014-04-14 13:13:20 -0400
0c5e457
Add placeholder functions for parsing c and java in bitshift/parser. Add parse_py function with helper functions. Parse_py grabs relevant information on variables, functions, and classes from abstract syntax tree of codelet code. by
2014-04-14 12:53:31 -0400
903e4cc
Add constants in bitshift/config.py for languages instead of just strings. by
2014-04-14 12:51:42 -0400
efdcb37
Add docstrings for functions in parser. Add ivar for syntax tree to codelet documentation. by
2014-04-11 13:14:59 -0400
d88e68e
Add dispatch 'parse' function to parser __init__.py. Basic code language identification as well. Included pycparser as a depedency. by
2014-04-11 13:03:13 -0400
4dfd297
Update some documentation. by
2014-05-05 10:35:57 -0400
c4816c2
Merge branch 'develop' into feature/query_parser by
2014-05-05 10:28:03 -0400
2cf98df
Merge branch 'develop' of github.com:earwig/bitshift into develop by
2014-05-04 01:21:52 -0400
a3b1f6d
Merge branch 'feature/database' into develop by
2014-05-04 01:18:52 -0400
56f23e6
Database to v6; flesh out a lot of Database.search(). by
2014-05-04 01:18:30 -0400
7c5c9fc
Add GitHub stars, Bitbucket watchers; close #14. by
2014-05-03 22:20:12 -0400
d142f1f
Complete Crawler. Close #15, #14, #11, #8. by
2014-05-03 15:22:29 -0400
6762c1f
Re-add logging, rem file filters. by
2014-05-03 15:06:03 -0400
1b2739f
Add GitHub repo star count, simple logging. by
2014-04-30 15:20:15 -0400
ad7ce9d
Commit latest crawler, continue fix of #8. by
2014-04-29 12:53:49 -0400
f387727
Remove some subprocesses, comment out logging. by
2014-04-19 15:33:21 -0400
2954161
Add partially integrated BitbucketCrawler(). by
2014-04-18 21:31:10 -0400
93ed686
Add partially integrated BitbucketCrawler(). by
2014-04-18 21:31:10 -0400
6718650
First part of #8 fix. by
2014-04-18 12:01:06 -0400
3ce399a
Add threaded cloner, GitRepository class (#7). by
2014-04-17 14:05:12 -0400
755dce6
Add logging to crawler/indexer. by
2014-04-17 09:53:27 -0400
f4b28e6
Add file-ext regex rules, exception handlers. by
2014-04-17 09:05:28 -0400
627c848
Add tested indexer. by
2014-04-16 16:41:14 -0400
b680756
Test crawler, complete documentation. by
2014-04-16 13:32:04 -0400
b7ccec0
Add untested threaded indexer/crawler prototype. by
2014-04-15 11:08:53 -0400
97198ee
Update Crawler documentation. by
2014-04-15 09:40:11 -0400
c655d97
Add class ChangeDir, amend unsafe subprocess. by
2014-04-14 22:09:05 -0400
9fc4598
Clean up crawler/, fix minor bugs. by
2014-04-14 21:21:58 -0400
77b448c
Mod Codelet, mov codelet creation from crawler. by
2014-04-14 18:41:00 -0400
ef9c060
Mov author_files > git_inder, heavily refactor. by
2014-04-14 13:02:59 -0400
ef73c04
Add prototype repo-indexer script author_files.py. by
2014-04-13 21:57:22 -0400
950b699
Database to v5; finish Database.insert(). by
2014-05-03 17:50:16 -0400
d6ccdbd
Fix a couble Database bugs. by
2014-05-02 22:43:16 -0400
d2aef28
Finish database insertion, except for origins. by
2014-05-02 14:40:52 -0400
97b0644
Database to v4: split off symbol_locations table. by
2014-05-02 14:40:00 -0400
e3a8382
Flesh out most of Database.insert(). by
2014-04-30 14:44:45 -0400
821a6ae
DB -> v3 for symbol->code assoc vs. ->codelet (fixes #13) by
2014-04-30 14:44:31 -0400
0b655da
Finish migration to v2. by
2014-04-29 13:19:02 -0400
a5cc353
Credits. by
2014-04-29 12:42:11 -0400
22d6b62
Update schema to v2; database updates. by
2014-04-28 14:05:45 -0400
0d0a74f
Some more work on db stuff. by
2014-04-27 23:43:32 -0400
54bca58
Move database stuff to a subpackage; updates. by
2014-04-27 00:47:13 -0400
ad3de06
Fix some typos in the schema. by
2014-04-24 14:38:33 -0400
fb4e0d5
FULLTEXT KEYs where appropriate. by
2014-04-18 02:16:42 -0400
75b243f
Remove languages table; add indexed field for codelet rank. by
2014-04-17 20:33:14 -0400
1cbe669
More work on db schema; all except FTS indices. by
2014-04-17 19:25:42 -0400
bc3b9e7
Some more database design work. by
2014-04-17 17:33:14 -0400
085fd62
Database schema, hashing module, some other things. by
2014-04-15 00:38:12 -0400
962dd9a
Docstrings for Database methods; oursql dependency. by
2014-04-14 12:02:23 -0400
34e629b
Some early work on varous query objects. by
2014-04-14 10:01:34 -0400
20b518f
Minor refactor of codelet. by
2014-04-11 13:03:03 -0400
6a4ba58
Add Codelet, crawler dependencies to setup. by
2014-04-11 12:43:34 -0400
902d734
Update __init__.py. by
2014-04-11 12:36:55 -0400
b70e2c9
Update assets module with template docstring. by
2014-04-10 13:14:27 -0400
0c68988
CREATE THE THINGS by
2014-04-10 13:10:45 -0400
6a9598f
Basic setup.py. by
2014-04-09 13:10:33 -0400
08249e0
Fix __init__.py and add some info to README. by
2014-04-09 12:59:54 -0400
6adea4a
Adding basic sphinx documentation. by
2014-04-07 21:26:49 -0400
404a2fb
Fix names in license. by
2014-04-08 01:08:13 +0000
82147c7
Fix description. by
2014-03-31 20:32:22 -0400
6ff65c0
Merge branch 'master' into develop by
2014-03-27 23:28:53 -0400
f24d2a6
Add assets/config module, SASS files, templates. by
2014-03-27 21:32:55 -0400
9d06e0c
Add skeleton dir-structure, content to files. by
2014-03-27 19:49:51 -0400
02c5890
app.py by
2014-03-27 13:11:35 -0400
e3b711a
Initial commit by
2014-03-27 09:59:26 -0700