bitshift

Graf revizí

Autor	SHA1	Zpráva	Datum
Ben Kurtovic	816d003dd4	More work on query parsing.	před 10 roky
Ben Kurtovic	7b54df6335	Merge branch 'feature/parser' into feature/query_parser	před 10 roky
Ben Kurtovic	b5c22d3b4a	More work.	před 10 roky
Ben Kurtovic	cf2b48e217	More work on query tree structure.	před 10 roky
Ben Kurtovic	674f227b22	Work more on query tree structure.	před 10 roky
Benjamin Attal	be7c871cc9	Add Rakefile task for running ruby parse server.	před 10 roky
Benjamin Attal	d127ac94ad	1) All unavailable line numbers and column numbers become -1. 2) Add correct dependency in pom.xml	před 10 roky
Benjamin Attal	044a448602	Change the format of the symbols to fit with earwig's issue.	před 10 roky
Benjamin Attal	4cc0626a71	Catch ClassNotFound error in parser __init__.py	před 10 roky
Benjamin Attal	d8048a74f0	Fix data length sent to client from ruby server. Pad with extra bytes.	před 10 roky
Benjamin Attal	b16bc40d3f	Consolidate parsers into __init__.py. Update python.py parser.	před 10 roky
Benjamin Attal	71dec1d269	Modify the python parser. Make data more descriptive by adding data about function calls.	před 10 roky
Benjamin Attal	6e54eb5147	Java server tells python client how much data to read.	před 10 roky
Benjamin Attal	d8b234f462	Update docstrings and parser dispatching in parser init file.	před 10 roky
Benjamin Attal	7f1d9dd2d3	Add a working preliminary version of the ruby parser. Still need to add a rule for running it in the Rakefile. Add: parser_server.rb: - listens for connections from the python client process parser.rb: - creates a syntax tree from the input and returns relevant data about it to the client	před 10 roky
Benjamin Attal	08f16074fb	Add template for ruby parser	před 10 roky
Benjamin Attal	c859416d2d	Change test file to support different parsers	před 10 roky
Benjamin Attal	2d7c1f4768	Fix array out of bounds exception coming from JavaParser.java	před 10 roky
Benjamin Attal	64ef9b04f2	Remove unecessary imports	před 10 roky
Benjamin Attal	f451e426e0	Refactor of the Java Parser Mod: Parser.java: - Moved client reading and writing methods to the abstract parser class, so that it is not specific to the JavaParser JavaParser.java: - Implemented NodeVisitor._cache. The cache is a stack of data packets. When a node that we want information on is first visited, a new packet of data is pushed onto the stack. The child nodes of that original node than add information to the packet, and when the original node is traversed again on the way up the tree, the data is popped from the cache and added to the symbols. This makes it possible to gather information about various levels of the tree easily. JavaSymbols.java: - Refactor all the insertMethods to simply add a packet of data to the appropriate HashMap. Symbols.java - Add a createCoord method which returns an arraylist representing a point in a document.	před 10 roky
Benjamin Attal	2338887a52	Working version of java parser up and running.	před 10 roky
Benjamin Attal	19a5457f07	Change director structure for java	před 10 roky
Benjamin Attal	306875dae7	Make Parser implement runnable so parsing tasks can be started in separate threads. Make Parser constructor accept a client socket, add reading and writing methods for the socket to JavaParser. Parse main method sets up a server for accepting parse jobs from the crawler, and starts threads for each parse task.	před 10 roky
Benjamin Attal	77e2b6f524	Fix errors in java parser, mostly casting issues. In Parse.java, set up a tcp server for communication with python processes. Builds with maven	před 10 roky
Benjamin Attal	669c30cac7	Mod: Parse.java: Added comments JavaParser.java: Updated the genSymbols method and a private class 'NodeVisitor' which implements ASTVisitor. genSymbols returns an instance of the Symbols class containing all relevant data about the Java code. JavaSymbols.java: Add fields which map class, interface, method, field, and variable names to positions.	před 10 roky
Benjamin Attal	63b09caa6c	Changed directory structure of java parser. Decided on multiple parsers in different languages, refactored bitshift/parser to fit with that paradigm.	před 10 roky
Benjamin Attal	a1066dd093	Modify parser/__init__.py so that it communicates with the Java parsing process and reads a result back from a unique file. Add template files for Java parsers.	před 10 roky
Benjamin Attal	3bc748242d	Refactor parser/__init__.py for new parsing mechanism	před 10 roky
Benjamin Attal	430b7d3588	Remove unecessary submodule.	před 10 roky
Benjamin Attal	a8f918f7c4	Update class names. Move language ids to languages.py	před 10 roky
Benjamin Attal	0a57cf50e6	Add first version of the c parser Add: c.py - CTreeCutter class is very similar to PyTreeCutter. It utilizes self.cache as opposed to PyTreeCutter which doesn't yet. - CTreeCutter visit functions simply add start and end lines of the node to the cache, and visit_Decl pushes the cache onto accum. - parse_c performs a task identical to parse_py. However, many c files need to be pre-processed before they are parsed.	před 10 roky
Benjamin Attal	847410b13c	Minor fix-ups in python parser. Mod: python.py - Add self.cache to allow for saving of unassocaited metadata as the PyTreeCutter moves down the syntax tree. - Update docstrings.	před 10 roky
Benjamin Attal	d485b87f21	Fix docstring in bitshift/parser/python.py	před 10 roky
Benjamin Attal	b77db873c1	Refactor parsing in python by adding node visitor class. Performs same tasks as previous version, but is more concise. Add: bitshift/parser/python.py: Add PyTreeCutter class to perform actions on specific nodes.	před 10 roky
Benjamin Attal	4d8c818c05	Corrected documentation in bitshift/codelet.py and bitshift/parser/__init__.py	před 10 roky
Benjamin Attal	5db273a773	Bugfixes for _serialize function in bitshift/parser/python.py	před 10 roky
Benjamin Attal	0c5e4572f8	Add placeholder functions for parsing c and java in bitshift/parser. Add parse_py function with helper functions. Parse_py grabs relevant information on variables, functions, and classes from abstract syntax tree of codelet code.	před 10 roky
Benjamin Attal	903e4ccc05	Add constants in bitshift/config.py for languages instead of just strings.	před 10 roky
Benjamin Attal	efdcb3793a	Add docstrings for functions in parser. Add ivar for syntax tree to codelet documentation.	před 10 roky
Benjamin Attal	d88e68e16e	Add dispatch 'parse' function to parser __init__.py. Basic code language identification as well. Included pycparser as a depedency.	před 10 roky
Ben Kurtovic	4dfd297472	Update some documentation.	před 10 roky
Ben Kurtovic	c4816c2bb8	Merge branch 'develop' into feature/query_parser	před 10 roky
Ben Kurtovic	2cf98df3e2	Merge branch 'develop' of github.com:earwig/bitshift into develop Conflicts: app.py setup.py	před 10 roky
Ben Kurtovic	a3b1f6d0c3	Merge branch 'feature/database' into develop	před 10 roky
Ben Kurtovic	56f23e682a	Database to v6; flesh out a lot of Database.search().	před 10 roky
Severyn Kozak	7c5c9fc7e1	Add GitHub stars, Bitbucket watchers; close #14 . Add: bitshift/crawler/crawler.py -Add more efficient method of querying GitHub's API for stargazer counts, by batching 25 repositories per request. -Add watcher counts for Bitbucket repositories, by querying the Bitbucket API once per repository (inefficient, but the API in question isn't sufficiently robust to accommodate a better approach, and Git repositories surface so infrequently that there shouldn't be any query limit problems).	před 10 roky
Severyn Kozak	d142f1fd55	Complete Crawler. Close #15 , #14 , #11 , #8 . Several of the closed issues were addressed partly in previous commits; definitively close them with this, for the moment, final update to the crawler package. Ref: bitshift/crawler/indexer.py -move all `GitIndexer` specific functions (eg, `_decode`, `_is_ascii()`)from the global scope to the class definition.	před 10 roky
Severyn Kozak	6762c1fa3d	Re-add logging, rem file filters. Add: bitshift/ __init__.py -add `_configure_logging()`, which sets up a more robust logging infrastructure than was previously used: log files are rotated once per hour, and have some additional formatting rules. (crawler, indexer).py -add hierarchically-descending loggers to individual threaded classes (`GitHubCrawler`, `GitIndexer`, etc.); add logging calls. indexer.py -remove file filtering regex matches from `_get_tracked_files()`, as non-code files will be discarded by the parsers.	před 10 roky
Severyn Kozak	1b2739f8c4	Add GitHub repo star count, simple logging. Add: bitshift/crawler/crawler.py -add `_get_repo_stars()` to `GitHubCrawler`, which queries the GitHub API for the number of a stars that a given repository has. -log the `next_api_url` every time it's generated by `GitHubCrawler` and `BitbucketCrawler` to two respective log-files.	před 10 roky
Severyn Kozak	ad7ce9d9cf	Commit latest crawler, continue fix of #8 . Add: bitshift/crawler/*.py -Remove use of the `logging` module, which appeared to be causing a memory leak even with log-file rotation.	před 10 roky

1 2

100 Revize (816d003dd4a2a982c78c37bf52245726853db34a) Všechny větve Vyhledat

100 Revize (816d003dd4a2a982c78c37bf52245726853db34a)

Všechny větve