- 25 Apr, 2018 1 commit
-
-
Eesaan Atluri authored
The default result window is 10 and we need to explicitly specify (10000 here) for all the results to be dumped into the resonse obj. If not you will get only partial results.
-
- 24 Apr, 2018 2 commits
-
-
Eesaan Atluri authored
Printing just the unique subjects and their count.
-
Eesaan Atluri authored
Converting the resulting dictionary elements into a list and counting the no. of occurences of each value is inefficient when you have a dictionary object to operate on already. Dictionaries use hashtables, using key value-hash pairs they can make accessing a value efficient but arrays are sequentially stored and randomly accessed, making the lookups slow when you have large no. of values.
-
- 23 Apr, 2018 2 commits
-
-
Eesaan Atluri authored
purpose: To see unique subjects and their no. of occurences so that we can get a handle on the hot topic.
-
Eesaan Atluri authored
This piece checks and connects to elastic instance if it is up and does some basic queries and returns a json formatted result. More useful queries coming up.
-
- 19 Apr, 2018 1 commit
-
-
Eesaan Atluri authored
Merge remote-tracking branch 'louistw/master'
-
- 18 Apr, 2018 3 commits
-
-
Bo-Chun Chen authored
Modified date format to include timezone See merge request louistw/lsoftarchive2json!7
-
Krish Moodbidri authored
-
Eesaan Atluri authored
Merge remote-tracking branch 'louistw/master'
-
- 16 Apr, 2018 6 commits
-
-
Bo-Chun Chen authored
Replacing non-breaking space to real space is enough to fix the problem that stops the email parser
-
Bo-Chun Chen authored
To extract reply content only, eliminate the previous messages
-
Bo-Chun Chen authored
-
Bo-Chun Chen authored
Since support-watch has more message in a month, 2 digits is not enough.
-
Bo-Chun Chen authored
Since ElasticSearch is case-sensitive in field name
-
Eesaan Atluri authored
-
- 12 Apr, 2018 1 commit
-
-
Eesaan Atluri authored
-
- 10 Apr, 2018 1 commit
-
-
Bo-Chun Chen authored
Using BeautifulSoup to extract content from text/html content type
-
- 09 Apr, 2018 4 commits
-
-
Eesaan Atluri authored
-
Eesaan Atluri authored
Date format has to be changed to match the type accepted for mapping
-
Bo-Chun Chen authored
-
Bo-Chun Chen authored
Changed fields selected to only Date, From, Subject, and Body Formatted Date field, which accepted by ElasticSearch Formatted From field, which removed unnecessary double-quote
-
- 06 Apr, 2018 8 commits
-
-
Eesaan Atluri authored
-
Eesaan Atluri authored
Conflicts: spliter.sh
-
Eesaan Atluri authored
Mac uses {999} and other systems use {*} in csplit utility.
-
Bo-Chun Chen authored
Made the parser output only if the message is parsed correctly and has content
-
Bo-Chun Chen authored
Changed some syntax to be runable in python3
-
Bo-Chun Chen authored
It will run with option {999} only on Mac, e.g. uanme -s is 'Darwin'
-
Bo-Chun Chen authored
Script now can take more than one filename as arguments
-
Bo-Chun Chen authored
Added permissions for direct execute of script
-
- 05 Apr, 2018 8 commits
-
-
Bo-Chun Chen authored
Master See merge request louistw/lsoftarchive2json!5
-
Bo-Chun Chen authored
Update spliter.sh See merge request louistw/lsoftarchive2json!4
-
Eesaan Atluri authored
Removed the line responsible for adding the metadata to the Raw json. It is needed if we load json manually into the elastic search, but jsonpyes takes care of it implicitly. The raw json we get from parser is just enough to be used with the jsonpyes tool as an input.
-
Krish Moodbidri authored
input unrestricted to .txt file.
-
Eesaan Atluri authored
-
Bo-Chun Chen authored
Fixed script path and made executable See merge request louistw/lsoftarchive2json!2
-
John-Paul Robinson authored
Added env support for finding python and permissions for direct execute of script.
-
Bo-Chun Chen authored
-
- 04 Apr, 2018 3 commits
-
-
Eesaan Atluri authored
Conflicts: parser.py
-
Bo-Chun Chen authored
Only continue working on extracting data when the parser parses successfully
-
Bo-Chun Chen authored
Check both content type and filename for the body Since only attached file has filename, email body does not
-