summaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAge
* Convert notmuch_thread_ids_t to notmuch_terms_tCarl Worth2009-10-25
| | | | | | Aside from increased code sharing, the benefit here is that now thread_ids iterates over the terms of a message rather than the thread_id value. So we'll now be able to drop that value.
* Implement notmuch_tags_t on top of new notmuch_terms_tCarl Worth2009-10-25
| | | | | | | The generic notmuch_terms_t iterator should provide support for notmuch_thread_ids_t when we switch as well, (And it would be interesting to see if we could reasonably make this support a PostingIterator too. Time will tell.)
* Shuffle the value numbers around in the database.Carl Worth2009-10-24
| | | | | | | | | | | | | | | | | | | | First, it's nice that for now we don't have any users yet, so we can make incompatible changes to the database layout like this without causing trouble. ;-) There are a few reasons for this change. First, we now use value 0 uniformly as a timestamp for both mail and timestamp documents, (which lets us cleanup an ugly and fragile bare 0 in the add_value and get_value calls in the timestamp code). Second, I want to drop the thread value entirely, so putting it at the end of the list means we can drop it as compatible change in the future. (I almost want to drop the message-ID value too, but it's nice to be able to sort on it to get diff-able output from "notmuch dump".) But the thread value we never use as a value, (we would never sort on it, for example). And it's totally redundant with the thread terms we store already. So expect it to disappear soon.
* Invent our own prefix values.Carl Worth2009-10-24
| | | | | | | | | | | We're now dropping all pretense of keeping the database directly compatible with sup's current xapian backend. (But perhaps someone might write a new nothmuch backend for sup in the future.) In coming up with the prefix values here, I tried to follow the conventions of http://xapian.org/docs/omega/termprefixes.html as closely as makes sense, (with some domain translation from "web" to "email archive").
* Split BOOLEAN_PREFIX into INTERNAL and EXTERNAL subsets.Carl Worth2009-10-24
| | | | | | | | | | | | | The idea here is that only some of the prefix names (such as "id" and "tag") actually make sense in external user-supplied query strings. Other things like "type" are internal implementation details of how we store things in the database. So internal machinery will add those terms to the database and we don't need to support them in the string itself. With this, we can now simply loop over the external prefix values to let the quiery parser know about them. So as we add prefixes in the future, we'll only need to add them to this list.
* Change all occurrences of "msgid" to "id".Carl Worth2009-10-24
| | | | What's good for the user is good for the internals.
* Add bash-completion script for notmuch.Carl Worth2009-10-24
| | | | | | It's not much of a script, (we don't have that many commands after all), but it's the kind of thing that's nice to have and gives the tool a slightly more polished feel.
* Add the magic to allow searches such as "tag:inbox".Carl Worth2009-10-24
| | | | | | | | | | | The key for this is call add_boolean_prefix on the QueryParser object. That tells the query parser to take something like "tag:inbox" and transform it into the "Linbox" term and do what it needs to do to make this term a requirement of the search. We're starting to have a real system here. Also, I didn't want to expose the ugly name of "msgid" to the user, so we add a prefix name of simply "id" instead.
* Use _find_prefix instead of hard-coded term in notmuch_query_searchCarl Worth2009-10-24
| | | | | | I'm planning to change prefix values soon, which would break code like this. So eliminate the fragility by going through our existing _find_prefix function.
* Fix bit-twiddling brain damage in notmuch_query_searchCarl Worth2009-10-24
| | | | | | | | | Here's the big bug that was preventing any searches from working at all like desired. I did the work to carefully pick out exactly the flags that I wanted, and then I threw it away by trying to combine them with & instead of | (so just passing 0 for flags instead). Much better now.
* Add debugging code for examining query strings.Carl Worth2009-10-24
| | | | | | | | | | | It's nice that Xapian provides a little function to print a textual representation of the entire query tree. So now, if you compile like so: make CFLAGS=-DDEBUG_QUERY then you get a nice output of the query string received by the query module, and the final query actually being sent to Xapian.
* Add a preliminary "notmuch search" command.Carl Worth2009-10-24
| | | | | | | | | | | | | This isn't behaving at all like it's documented yet, (for example, it's returning message IDs not thread IDs[*]). In fact, the output code is just a copy of the body of "notmuch dump", so all you get for now is message ID and tags. But this should at least be enough to start exercising the query functionality, (which is currently very buggy). [*] I'll want to convert the databse to store thread documents before fixing that.
* notmuch_database_create: Document idea to (optionally) return a statusCarl Worth2009-10-24
| | | | | | | The current problem is that when this function fails the caller doesn't get any information about what the particular failure was, (something in the filesystem? or in Xapian?). We should fix that.
* notmuch setup/new: Propagate failure from notmuch_database_set_timestampCarl Worth2009-10-24
| | | | | | | | | | With some recent testing, the timestamp was failing, (overflowing the term limit), and reporting an error, but the top-level notmuch command was still returning a success return value. I think it's high time to add a test suite, (and the code base is small enough that if we add it now it shouldn't be *too* hard to shoot for a very high coverage percentage).
* Fix timestamp generation to avoid overflowing the term limitCarl Worth2009-10-24
| | | | | | The previous code was only correct as long as the timestamp prefix was only a single character. But with the recent change to a multi-character prefix, this broke. So fix it now.
* Trim down prefix list to things we are actually using.Carl Worth2009-10-24
| | | | | | | | | | | | | | | | | | | I've decided not to try for sup compatibility at the leve of the xapian datbase. There's just too much about sup's usage of the database that I don't like, (beyond the embedded ruby data structures there is redundant storage of message IDs, thread IDs, and dates (in both terms and values)). I'm going to fix that up in the database of notmuch, with some other changes as well. (I plan to drop "reference" terms once linkage to a thread ID through the reference is established. I also plan to add actual documents to represent threads.) So with all that incompatibility, I might as well make my own prefix values. And while doing that, I should try to be as compatible as possible with the conventions described here: http://xapian.org/docs/omega/termprefixes.html
* Move the prefix-string arrays back into database.cc from message.ccCarl Worth2009-10-24
| | | | | Yes, I'm being wishy-washy here, moving code back and forth. But this is where these really do belong.
* Revert "Remove some unneeded initializers."Carl Worth2009-10-24
| | | | | | | This reverts commit fb1bae07002d45138832eacb280419dbd7a19774. These initializers were totally necessary. I clearly wasn't thinking straight when I removed them.
* Cut the enthusiasm a bit.Carl Worth2009-10-23
| | | | It gets annoying pretty quick.
* Make "notmuch new" ignore directories that are read-only.Carl Worth2009-10-23
| | | | | | With this, "notmuch new" is now plenty fast even with large archives spanning many sub-directories. Document this both in "notmuch help" and also in the output of notmuch setup.
* add_files: Pull one stat out of the recrusive function.Carl Worth2009-10-23
| | | | | There's no need to stat each directory both before and after each recursive call.
* More fixing of plurals.Carl Worth2009-10-23
| | | | | It definitely doesn't help that we have the same messages in both "setup" and "new". Should combine those really.
* More care in final status reporting.Carl Worth2009-10-23
| | | | | Printing "Added 1 new messages" just looks like lack of attention to detail, (but yes plurals can be annoying this way).
* Print a better message than "0s" for zero seconds.Carl Worth2009-10-23
| | | | It's nice to have a tool that at least construct actual sentences.
* Add new "notmuch new" command.Carl Worth2009-10-23
| | | | | | | | Finally, I can get new messages into my notmuch database without having to run a complete "notmuch setup" again. This takes advantage of the recent timestamp capabilities in the database to avoid looking into directories that haven't changed since the last time "notmuch new" was run.
* add_files: Change to return a status value instead of voidCarl Worth2009-10-23
| | | | | Also change to use goto rather than early returns. And once again, there were lots of bugs in the error cases previously.
* notmuch setup: Clean up the progress printing a bit.Carl Worth2009-10-23
| | | | | | | | | | Get rid of a useless leading 0 on the seconds value, and make a distinction between "files" and "messages", (we process many files, but not all of them are recongized as messages). Finally, add a summary line at the end saying how many unique messages were added to the database. Since this comes right after the total number of files, it gives the user at least a hint as to how many messages were encountered with duplicate message IDs.
* Re-order documentation a bit.Carl Worth2009-10-23
| | | | | | The notmuch_database_get_default_path function is unique in not accepting a notmuch_database_t* (nor creating one). So list it outside the other notmuch_database functions.
* notmuch_message_get_filename: Improve documentation.Carl Worth2009-10-23
| | | | | Fix a typo, and add clarifications about the lifetime and readonly nature of the return value.
* Remove some unneeded initializers.Carl Worth2009-10-23
| | | | | | | | | | Some people might argue for more initializers to be "safer", but I actually prefer to leave things this way. It saves typing, but the real benefit is that the things that do require initialization stand out so we know to watch them carefully. And with valgrind, we actually get to catch errors earlier if we *don't* initialize them. So that can be "safer" ironically enough.
* notmuch setup: Fix a couple of error paths.Carl Worth2009-10-23
| | | | | We had early returns instead of goto statments, and sure enough, they were leaking. Much cleaner this way.
* _find_prefix: Exit when given an invalid prefix name.Carl Worth2009-10-23
| | | | This will be a nice safety check for internal sanity.
* Add NOTMUCH_STATUS_DUPLICATE_MESSAGE_IDCarl Worth2009-10-23
| | | | | | | | | And document that notmuch_database_add_message can return this value. This pushes the hard decision of what to do with duplicate messages out to the user, but that's OK. (We weren't really doing anything with these ourselves, and this way the user is at least informed of the issue, rather than it just getting papered over internally.)
* Clean up comments to not include spaces before tabs.Carl Worth2009-10-23
| | | | | | This were just unclean, (an invisble sort of uncleanliness, but still there are liable to make for ugly diffs). Oh, wait, like this one! But at least it's not sprinkled among code changes.
* Clarify documentation and error string for NOTMUCH_STATUS_TAG_TOO_LONGCarl Worth2009-10-23
| | | | It's helpful to point out NOTMUCH_STATUS_TAG_MAX for users.
* Add notmuch_database_set_timestamp and notmuch_database_get_timestampCarl Worth2009-10-23
| | | | | These will be very helpful to implement an efficient "notmuch new" command which imports new mail messages that have appeared.
* database: Add private find_unique_doc_id and find_unique_document functionsCarl Worth2009-10-23
| | | | | | These are a generalization of the unique-ness testing of notmuch_database_find_message. More preparation for firectory timestamps.
* database: Similarly rename find_message_by_docid to find_document_for_doc_idCarl Worth2009-10-23
| | | | | | | | Again preferring notmuch_database_t* over Xapian::Database*. Also, we're standardizing on "doc_id" rather than "docid" locally, (as an analoge to "message_id"), in spite of the "Xapian::docid" name, (which, fortunately, we can ignore and just us "unsigned int" instead).
* database: Rename internal find_messages_by_term to find_doc_idsCarl Worth2009-10-23
| | | | | | | | | | This name is a more accurate description of what it does, and the more general naming will make sense as we start storing non-message documents in the database (such as directory timestamps). Also, don't pass around a Xapian::Database where it's more our style to pass a notmuch_database_t*.
* sha1: Add new notmuch_sha1_of_string functionCarl Worth2009-10-23
| | | | | | | | | | | | | | | We'll be using this for storing really long terms in the database and when we just need to look them up, (and never read back the original data directly from the database). For example, storing arbitrarily long directory paths in the database along with mtime timestamps. Note that if we did want to store arbitrarily long terms and also be able to read them back, the Xapian folks recommending splitting the term off with multiple prefixes. See the note near the end of this page: http://trac.xapian.org/wiki/FAQ/UniqueIds
* notmuch restore: Print names of tags that cannot be appliedCarl Worth2009-10-23
| | | | | | | | | | | | | | | | This helps the user gauge the severity of the error. For example, when restoring my sup tags I see a bunch of tags missing for message IDs of the form "sup-faked-...". That's not surprising since I know that sup generates these with the md5sum of the message header while notmuch uses the sha-1 of the entire message. But how much will this hurt? Well, now that I can see that most of the missing tags are just "attachment", then I'm not concerned, (I'll be automatically creating that tag in the future based on the message contents). But if a missing tag is "inbox" then that's more concerning because that's data that I can't easily regenerate outside of sup.
* notmuch_tags_has_more: Fix to use string.empty rather than string.sizeCarl Worth2009-10-23
| | | | | I'm really interested in the length of the data here, not the size of the storage.
* Fix notmuch_message_get_message_id to never return NULL.Carl Worth2009-10-23
| | | | | | With the recent improvements to the handling of message IDs we "know" that a NULL message ID is impossible, (so we simply abort if the impossible happens).
* add_message: Fix to not add multiple documents with the same message IDCarl Worth2009-10-23
| | | | | | | | Here's the second big fix to message-ID handling, (the first was to generate message IDs when an email contained none). Now, with no document missing a message ID, and no two documents having the same message ID, we have a nice consistent database where the message ID can be used as a unique key.
* Add _notmuch_message_create_for_message_idCarl Worth2009-10-23
| | | | | | | This is the last piece needed for add_message to be able to properly support a message with a duplicate message ID. This function creates a new notmuch_message_t object but one that may reference an existing document in the database.
* Fix _notmuch_message_create to catch Xapian DocNotFoundError.Carl Worth2009-10-23
| | | | | | | | This function is only supposed to be called with a doc_id that was queried from the database already. So there's an internal error if no document with that doc_id can be found in the database. In that case, return NULL.
* Add internal functions for manipulating a new notmuch_message_tCarl Worth2009-10-23
| | | | | | | | | | | This will support the add_message function in incrementally creating state in a new notmuch_message_t. The new functions are _notmuch_message_set_filename _notmuch_message_add_thread_id _notmuch_message_ensure_thread_id _notmuch_message_set_date _notmuch_message_sync
* Add notmuch_message_get_filenameCarl Worth2009-10-23
| | | | | | | | This is a new public function to find the filename of the original email message for a message-object that was found in the database. We may change this function in the future to support returning a list of filenames, (for messages with duplicate message IDs).
* add_message: Re-order the code a bit (find message-id first).Carl Worth2009-10-23
| | | | | | | | We're preparing for being able to deal with files with duplicate message IDs here. The plan is to create a notmuch_message_t object in add_message that may or may not reference a document that exists in the database. So to do this, we have to find the message ID before we do any manipulation of the doc.
* Move thread_id generation code from database.cc to message.ccCarl Worth2009-10-23
| | | | It's really up to the message to decide how to generate these.