summaryrefslogtreecommitdiff
path: root/lib/index.cc
Commit message (Collapse)AuthorAge
* Do not segfault on empty mime partsmartin f. krafft2010-04-13
| | | | | | | | | | | | | | | | | notmuch previously unconditionally checked mime parts for various properties, but not for NULL, which is the case if libgmime encounters an empty mime part. Upon encounter of an empty mime part, the following is printed to stderr (the second line due to my patch): (process:17197): gmime-CRITICAL **: g_mime_message_get_mime_part: assertion `GMIME_IS_MESSAGE (message)' failed Warning: Not indexing empty mime part. This is probably a bug that should get addressed in libgmime, but for not, my patch is an acceptable workaround. Signed-off-by: martin f. krafft <madduck@madduck.net>
* Eliminate some useless gobject boilerplate.Carl Worth2010-02-04
| | | | | | If we had external users of this filter then they might expect some of these macros to exist. But since this is just internal, that's just unneeded noise.
* notmuch new: Don't index uuencoded data.Carl Worth2010-02-04
| | | | | | | | | | | | | | With modern MIME attachments, we're already avoiding indexing the attachments. But for old-school uuencoded data in the mail, we have been directly indexing the encoded data as terms, (which is not useful at all---nobody will ever ytry to search based on the seemingly random uuencoded data). Additionally, indexing a modestly large uuencoded file seems to make Xapian go insane, (consuming *lots* of memory). We fix both problems by detecting uuencoded content and not performing any indexing of it.
* Index content from citations and signatures.Carl Worth2010-01-06
| | | | | | | | In the presentation we often omit citations and signatures, but this is not content that should be omitted from the index, (especially when the citation detection is wrong---see cases where a line beginning with "From" is corrupted to ">From" by mail processing tools).
* database: Store mail filename as a new 'direntry' term, not as 'data'.Carl Worth2010-01-06
| | | | | | | | | | Instead of storing the complete message filename in the data portion of a mail document we now store a 'direntry' term that contains the document ID of a directory document and also the basename of the message filename within that directory. This will allow us to easily store multple filenames for a single message, and will also allow us to find mail documents for files that previously existed in a directory but that have since been deleted.
* lib/index: Fix memory leak for email addresses without names.Carl Worth2009-12-01
| | | | | | | | We carefully noted the fact that we had locally allocated the string here, but then we neglected to free it. Switch to talloc instead which makes it easier to get the behavior we want. It's simpler since we can just call talloc_free unconditionally, without having to track the state of whether we allocated the storage for name or not.
* TypsosIngmar Vanhassel2009-11-18
|
* Don't create "contact" terms in the database.Carl Worth2009-11-12
| | | | | | | | | | | We never did export any interface to get at these, and when I went to use these, I found them inadequate, (because I wanted to distinguish address found in from: from those found in To:). Meanwhile, it was easy enough to extract addresses with a search like: notmuch show tag:sent | grep ^To: so the storage of contact terms was just wasting space. Stop that.
* libify: Move library sources down into lib directory.Carl Worth2009-11-09
A "make" invocation still works from the top-level, but not from down inside the lib directory yet.