You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+41-11Lines changed: 41 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,8 @@ Elasticsearch For Beginners: Indexing your Gmail Inbox
2
2
=======================
3
3
4
4
5
-
#### What's this all about?
5
+
6
+
#### What's this all about?
6
7
7
8
I recently looked at my Gmail inbox and noticed that I have well over 50k emails, taking up about 12GB of space but there is no good way to tell what emails take up space, who sent them to, who emails me, etc
8
9
@@ -15,11 +16,11 @@ __Related tutorial:__ [Index and Search Hacker News using Elasticsearch and the
15
16
16
17
Set up [Elasticsearch](http://ohardt.us/es-install) and make sure it's running at [http://localhost:9200](http://localhost:9200)
17
18
18
-
I use Python and [Tornado](https://github.com/tornadoweb/tornado/) for the scripts to import and query the data. Run `pip install tornado` to install Tornado.
19
+
I use Python and [Tornado](https://github.com/tornadoweb/tornado/) for the scripts to import and query the data. Run `pip install tornado chardet` to install Tornado and chardet.
19
20
20
21
21
22
22
-
#### Aight, where do we start?
23
+
#### Aight, where do we start?
23
24
24
25
First, go [here](http://ohardt.us/download-gmail-mailbox) and download your Gmail mailbox, depending on the amount of emails you have accumulated this might take a while.
25
26
@@ -100,7 +101,7 @@ for part in parts:
100
101
101
102
##### Index the data with Elasticsearch
102
103
103
-
The most simple aproach is a PUT request per item:
104
+
The most simple approach is a PUT request per item:
0 commit comments