Benjamin Attal
4202552a1e
Remove unecessary import
10 lat temu
Ben Kurtovic
7bac5c0b9a
Update schema to v7; correctly order entries in cache.
10 lat temu
Ben Kurtovic
e0194ab346
Forgot to update query numbers.
10 lat temu
Ben Kurtovic
69b011ac12
Fix.
10 lat temu
Ben Kurtovic
8acddb6834
Fix cache retrieval behavior.
10 lat temu
Benjamin Attal
7337638b72
Change form get_lexer to guess_lexer -- guess_lexer is more accurate.
10 lat temu
Ben Kurtovic
7b9e98bc5b
Thanks @riamse
10 lat temu
Benjamin Attal
ca4bb8ff42
Change format of symbols to fit databse model.
10 lat temu
Ben Kurtovic
b2d9ad5c97
Missed a line.
10 lat temu
Ben Kurtovic
b6fabc54ec
Can't use executemany() here.
10 lat temu
Benjamin Attal
21cf52ea65
Call start_parse_servers from crawl.py
10 lat temu
Benjamin Attal
c9520fd4e3
Add function to start parse servers in parser/__init__.py
10 lat temu
Ben Kurtovic
e64c81f66f
Consistency fix: Use Codelet.url instead of Codelet.code_url.
10 lat temu
Benjamin Attal
65a2688be9
Adjust test for new language dict.
10 lat temu
Ben Kurtovic
f02dc4497c
Fixes.
10 lat temu
Ben Kurtovic
5d8ac664fe
HAHA WHAT ARE ITERATORS
10 lat temu
Benjamin Attal
5e6e3fcbf5
Adjust test for new language dict.
10 lat temu
Severyn Kozak
94953624c8
Fix #34 .
Add:
bitshift/crawler/indexer.py
-Add a `try-except` block to catch the `UnsupportedFileError` exception.
10 lat temu
Ben Kurtovic
5a83720617
Strip encoding lines.
10 lat temu
Severyn Kozak
fc8d478060
Untested fix #33 .
Add:
bitshift/crawler/indexer.py
-Add conditional to remove the full path of a repository if the owner's
directory contains only one sub-directory.
10 lat temu
Ben Kurtovic
a3eacc287e
Try to make exception reporting more useful.
10 lat temu
Ben Kurtovic
5d9ef2774d
Some fixes, mainly involving language detection.
10 lat temu
Ben Kurtovic
9f935bbb74
This is ugly, but it improves the current setup.
10 lat temu
Severyn Kozak
b698a16c98
Add parse() and insert() calls to crawler.
Add:
bitshift/crawler/indexer.py
-Add `parse()` and `insert()` calls to `_insert_repository_codelets()`.
10 lat temu
Severyn Kozak
f8436fa484
Part of #26 . Move __init__.py to crawl.py.
Add:
bitshift/crawler/(__init__, crawl).py
-Move `__init__.py` to `crawl.py`, and add a `main` block to allow
running the crawler via `python -m`.
10 lat temu
Ben Kurtovic
4dc83e913a
Implement /search.json route ( #27 )
10 lat temu
Ben Kurtovic
52c7e9cf5e
Finish JSON stringification ( closes #29 )
10 lat temu
Ben Kurtovic
326be3050f
Start work on serializing; clean up languages.
10 lat temu
Ben Kurtovic
878088f9ab
Split symbols into implicit OR groups ( closes #28 )
10 lat temu
Ben Kurtovic
860260d0ad
Finish Database.search() ( closes #19 )
10 lat temu
Ben Kurtovic
1fbe0e823b
Implement most of _get_codelets_from_ids().
10 lat temu
Ben Kurtovic
827ea09085
Only use dynamic ranks if the conditional is complex.
10 lat temu
Ben Kurtovic
7309ca5532
Some fixes and adjustments; note.
10 lat temu
Ben Kurtovic
ba8bde7689
Cleanup, fixes, additions.
10 lat temu
Ben Kurtovic
fbc5eebad4
Refactor query production; fixes.
10 lat temu
Ben Kurtovic
91256b1384
Finish parameterize() for Text.
10 lat temu
Ben Kurtovic
f49ad1d281
Move parameterization functions to Node classes.
10 lat temu
Ben Kurtovic
8fbfd4c45c
Fix _parse_node; _parse_term quote handling; should probably refactor.
10 lat temu
Ben Kurtovic
08ce46faeb
Implement FTS for authors.
10 lat temu
Ben Kurtovic
d3f6f226f1
Finish exploding symbols.
10 lat temu
Ben Kurtovic
1111afc198
Explode dates and symbols.
10 lat temu
Ben Kurtovic
cd27777f83
Start exploding query trees.
10 lat temu
Ben Kurtovic
4fa8b9f444
Improve exception behavior; cleanup.
10 lat temu
Severyn Kozak
43c6b2ca01
Add languages.json, integrate with languages.py.
Add:
bitshift/resources
-Add directory for bitshift's Python resources.
bitshift/resources/languages.json
-Add `json` file containing all the programming languages supported by
bitshift.
bitshift/languages.py
-Set `LANGS` by reading `languages.json`.
10 lat temu
Ben Kurtovic
7e876c835f
Add sort keys, fix a bug, add tests.
10 lat temu
Ben Kurtovic
30406483b9
Add unit tests, cleanup.
10 lat temu
Ben Kurtovic
983f9a5d94
Implement basic tree balancing.
10 lat temu
Ben Kurtovic
c9b558d0b3
Finish query parsing, mostly.
10 lat temu
Ben Kurtovic
611a4ea9b2
More accurate model for boolean logic.
10 lat temu
Ben Kurtovic
d319bde7db
Begin work on a model for boolean logic parsing.
10 lat temu