whatsnew-2.0.txt - platform/external/libevent - Git at Google

 What's New In Libevent 2.0 so far:

 1. Meta-issues

 1.1. About this document

   This document describes the key differences between Libevent 1.4 and
   Libevent 2.0, from a user's point of view.  It was most recently
   updated based on features in git master as of August 2010.

   NOTE: I am very sure that I missed some thing on this list.  Caveat
   haxxor.

 1.2. Better documentation

   There is now a book-in-progress that explains how to use Libevent and its
   growing pile of APIs.  As of this writing, it covers everything except the
   http and rpc code.  Check out the latest draft at
   http://www.wangafu.net/~nickm/libevent-book/ .

 2. New and Improved Event APIs

   Many APIs are improved, refactored, or deprecated in Libevent 2.0.

   COMPATIBILITY:

   Nearly all existing code that worked with Libevent 1.4 should still
   work correctly with Libevent 2.0.  However, if you are writing new code,
   or if you want to port old code, we strongly recommend using the new APIs
   and avoiding deprecated APIs as much as possible.

   Binaries linked against Libevent 1.4 will need to be recompiled to link
   against Libevent 2.0.  This is nothing new; we have never been good at
   preserving binary compatibility between releases.  We'll try harder in the
   future, though: see 2.1 below.

 2.1. New header layout for improved forward-compatibility

   Libevent 2.0 has a new header layout to make it easier for programmers to
   write good, well-supported libevent code.  The new headers are divided
   into three types.

   There are *regular headers*, like event2/event.h.  These headers contain
   the functions that most programmers will want to use.

   There are *backward compatibility headers*, like event2/event_compat.h.
   These headers contain declarations for deprecated functions from older
   versions of Libevent.  Documentation in these headers should suggest what's
   wrong with the old functions, and what functions you want to start using
   instead of the old ones.  Some of these functions might be removed in a
   future release.  New programs should generally not include these headers.

   Finally, there are *structure headers*, like event2/event_struct.h.
   These headers contain definitions of some structures that Libevent has
   historically exposed.  Exposing them caused problems in the past,
   since programs that were compiled to work with one version of Libevent
   would often stop working with another version that changed the size or
   layout of some object.  We've moving them into separate headers so
   that programmers can know that their code is not depending on any
   unstable aspect of the Libvent ABI.  New programs should generally not
   include these headers unless they really know what they are doing, are
   willing to rebuild their software whenever they want to link it
   against a new version of Libevent, and are willing to risk their code
   breaking if and when data structures change.

   Functionality that once was located in event.h is now more subdivided.
   The core event logic is now in event2/event.h.  The "evbuffer" functions
   for low-level buffer manipulation are in event2/buffer.h.  The
   "bufferevent" functions for higher-level buffered IO are in
   event2/bufferevent.h.

   COMPATIBILITY:

   All of the old headers (event.h, evdns.h, evhttp.h, evrpc.h, and
   evutil.h) will continue to work by including the corresponding new
   headers.  Old code should not be broken by this change.

 2.2. New thread-safe, binary-compatible, harder-to-mess-up APIs

   Some aspects of the historical Libevent API have encouraged
   non-threadsafe code, or forced code built against one version of Libevent
   to no longer build with another.  The problems with now-deprecated APIs
   fell into two categories:

      1) Dependence on the "current" event_base.  In an application with
         multiple event_bases, Libevent previously had a notion of the
         "current" event_base.  New events were linked to this base, and
         the caller needed to explicitly reattach them to another base.
         This was horribly error-prone.

         Functions like "event_set" that worked with the "current" event_base
         are now deprecated but still available (see 2.1).  There are new
         functions like "event_assign" that take an explicit event_base
         argument when setting up a structure.  Using these functions will help
         prevent errors in your applications, and to be more threadsafe.

      2) Structure dependence.  Applications needed to allocate 'struct
         event' themselves, since there was no function in Libevent to do it
         for them.  But since the size and contents of struct event can
         change between libevent versions, this created binary-compatibility
         nightmares.  All structures of this kind are now isolated in
         _struct.h header (see 2.1), and there are new allocate-and-
         initialize functions you can use instead of the old initialize-only
         functions.  For example, instead of malloc and event_set, you
         can use event_new().

         (For people who do really want to allocate a struct event on the
         stack, or put one inside another structure, you can still use
         event2/event_compat.h.)

    So in the case where old code would look like this:

       #include <event.h>
       ...
       struct event *ev = malloc(sizeof(struct event));
       /* This call will cause a buffer overrun if you compile with one version
          of Libevent and link dynamically against another. */
       event_set(ev, fd, EV_READ, cb, NULL);
       /* If you forget this call, your code will break in hard-to-diagnose
          ways in the presence of multiple event bases. */
       event_set_base(ev, base);

    New code will look more like this:

      #include <event2/event.h>
      ...
      struct event *ev;
      ev = event_new(base, fd, EV_READ, cb, NULL);

 2.3. Overrideable allocation functions

   If you want to override the allocation functions used by libevent
   (for example, to use a specialized allocator, or debug memory
   issues, or so on), you can replace them by calling
   event_set_mem_functions.  It takes replacements for malloc(),
   free(), and realloc().

   If you're going to use this facility, you need to call it _before_
   Libevent does any memory allocation; otherwise, Libevent may allocate some
   memory with malloc(), and free it with the free() function you provide.

   You can disable this feature when you are building Libevent by passing
   the --disable-malloc-replacement argument to configure.

 2.4. Configurable event_base creation

   Older versions of Libevent would always got the fastest backend
   available, unless you reconfigured their behavior with the environment
   variables EVENT_NOSELECT, EVENT_NOPOLL, and so forth.  This was annoying
   to programmers who wanted to pick a backend explicitly without messing
   with the environment.

   Also, despite our best efforts, not every backend supports every
   operation we might like.  Some features (like edge-triggered events, or
   working with non-socket file descriptors) only work with some operating
   systems' fast backends.  Previously, programmers who cared about this
   needed to know which backends supported what.  This tended to get quite
   ungainly.

   There is now an API to choose backends, either by name or by feature.
   Here is an example:

       struct event_config_t *config;
       struct event_base *base;

       /* Create a new configuration object. */
       config = event_config_new();
       /* We don't want to use the "select" method. */
       event_config_avoid_method(config, "select");
       /* We want a method that can work with non-socket file descriptors */
       event_config_require_features(config, EV_FEATURE_FDS);

       base = event_base_new_with_config(config);
       if (!base) {
          /* There is no backend method that does what we want. */
          exit(1);
       }
       event_config_free(config);

   Supported features are documented in event2/event.h

 2.5. Socket is now an abstract type

   All APIs that formerly accepted int as a socket type now accept
   "evutil_socket_t".  On Unix, this is just an alias for "int" as
   before.  On Windows, however, it's an alias for SOCKET, which can
   be wider than int on 64-bit platforms.

 2.6. Timeouts and persistent events work together.

   Previously, it wasn't useful to set a timeout on a persistent event:
   the timeout would trigger once, and never again.  This is not what
   applications tend to want.  Instead, applications tend to want every
   triggering of the event to re-set the timeout.  So now, if you set
   up an event like this:
        struct event *ev;
        struct timeval tv;
        ev = event_new(base, fd, EV_READ|EV_PERSIST, cb, NULL);
        tv.tv_sec = 1;
        tv.tv_usec = 0;
        event_add(ev, &tv);

   The callback 'cb' will be invoked whenever fd is ready to read, OR whenever
   a second has passed since the last invocation of cb.

 2.7. Multiple events allowed per fd

   Older versions of Libevent allowed at most one EV_READ event and at most
   one EV_WRITE event per socket, per event base.  This restriction is no
   longer present.

 2.8. evthread_* functions for thread-safe structures.

   Libevent structures can now be built with locking support.  This code
   makes it safe to add, remove, and activate events on an event base from a
   different thread.  (Previously, if you wanted to write multithreaded code
   with Libevent, you could only an event_base or its events in one thread at
   a time.)

   If you want threading support and you're using pthreads, you can just
   call evthread_use_pthreads().  (You'll need to link against the
   libevent_pthreads library in addition to libevent_core.  These functions are
   not in libevent_core.)

   If you want threading support and you're using Windows, you can just
   call evthread_use_windows_threads().

   If you are using some locking system besides Windows and pthreads, You
   can enable this on a per-event-base level by writing functions to
   implement mutexes, conditions, and thread IDs, and passing them to
   evthread_set_lock_callbacks and related functions in event2/thread.h.

   Once locking functions are enabled, every new event_base is created with a
   lock.  You can prevent a single event_base from being built with a lock
   disabled by using the EVENT_BASE_FLAG_NOLOCK flag in its
   event_config.  If an event_base is created with a lock, it is safe to call
   event_del, event_add, and event_active on its events from any thread.  The
   event callbacks themselves are still all executed from the thread running
   the event loop.

   To make an evbuffer or a bufferevent object threadsafe, call its
   *_enable_locking() function.

   The HTTP api is not currently threadsafe.

   To build Libevent with threading support disabled, pass
   --disable-thread-support to the configure script.

 2.9. Edge-triggered events on some backends.

   With some backends, it's now possible to add the EV_ET flag to an event
   in order to request that the event's semantics be edge-triggered.  Right
   now, epoll and kqueue support this.

   The corresponding event_config feature is EV_FEATURE_ET; see 2.4 for more
   information.

 2.10. Better support for huge numbers of timeouts

   The heap-based priority queue timer implementation for Libevent 1.4 is good
   for randomly distributed timeouts, but suboptimal if you have huge numbers
   of timeouts that all expire in the same amount of time after their
   creation.  The new event_base_init_common_timeout() logic lets you signal
   that a given timeout interval will be very common, and should use a linked
   list implementation instead of a priority queue.

 2.11. Improved debugging support

   It's been pretty easy to forget to delete all your events before you
   re-initialize them, or otherwise put Libevent in an internally inconsistent
   state.  You can tell libevent to catch these and other common errors with
   the new event_enable_debug_mode() call.  Just invoke it before you do
   any calls to other libevent functions, and it'll catch many common
   event-level errors in your code.

 2.12. Functions to access all event fields

   So that you don't have to access the struct event fields directly, Libevent
   now provides accessor functions to retrieve everything from an event that
   you set during event_new() or event_assign().

 3. Backend-specific and performance improvements.

 3.1. Change-minimization on O(1) backends

   With previous versions of Libevent, if you called event_del() and
   event_add() repeatedly on a single event between trips to the backend's
   dispatch function, the backend might wind up making unnecessary calls or
   passing unnecessary data to the kernel.  The new backend logic batches up
   redundant adds and deletes, and performs no more operations than necessary
   at the kernel level.

   This logic is on for the kqueue backend, and available (but off by
   default) for the epoll backend.  To turn it on for the epoll backend,
   set the EVENT_BASE_FLAG_EPOLL_USE_CHANGELIST flag in the
   event_base_cofig, or set the EVENT_EPOLL_USE_CHANGELIST environment
   variable.  Doing this with epoll may result in weird bugs if you give
   any fds closed by dup() or its variants.

 3.2. Improved notification on Linux

   When we need to wake the event loop up from another thread, we use
   an epollfd to do so, instead of a socketpair.  This is supposed to be
   faster.

 3.3. Windows: better support for everything

   Bufferevents on Windows can use a new mechanism (off-by-default; see below)
   to send their data via Windows overlapped IO and get their notifications
   via the IOCP API.  This should be much faster than using event-based
   notification.

   Other functions throughout the code have been fixed to work more
   consistently with Windows.  Libevent now builds on Windows using either
   mingw, or using MSVC (with nmake).  Libevent works fine with UNICODE
   defined, or not.

   Data structures are a little smarter: our lookups from socket to pending
   event are now done with O(1) hash tables rather than O(lg n) red-black
   trees.

   Unfortunately, the main Windows backend is still select()-based: from
   testing the IOCP backends on the mailing list, it seems that there isn't
   actually a way to tell for certain whether a socket is writable with IOCP.
   Libevent 2.1 may add a multithreaded WaitForMultipleEvents-based
   backend for better performance with many inactive sockets and better
   integration with Windows events.

 4. Improvements to evbuffers

   Libevent has long had an "evbuffer" implementation to wrap access to an
   input or output memory buffer.  In previous versions, the implementation
   was very inefficient and lacked some desirable features.  We've made many
   improvements in Libevent 2.0.

 4.1. Chunked-memory internal representation

   Previously, each evbuffer was a huge chunk of memory.  When we ran out of
   space in an evbuffer, we used realloc() to grow the chunk of memory.  When
   data was misaligned, we used memmove to move the data back to the front
   of the buffer.

   Needless to say, this is a terrible interface for networked IO.

   Now, evbuffers are implemented as a linked list of memory chunks, like
   most Unix kernels use for network IO.  (See Linux's skbuf interfaces,
   or *BSD's mbufs).  Data is added at the end of the linked list and
   removed from the front, so that we don't ever need realloc huge chunks
   or memmove the whole buffer contents.

   To avoid excessive calls to read and write, we use the readv/writev
   interfaces (or WSASend/WSARecv on Windows) to do IO on multiple chunks at
   once with a single system call.

   COMPATIBILITY NOTE:
   The evbuffer struct is no longer exposed in a header.  The code here is
   too volatile to expose an official evbuffer structure, and there was never
   any means provided to create an evbuffer except via evbuffer_new which
   heap-allocated the buffer.

   If you need access to the whole buffer as a linear chunk of memory, the
   EVBUFFER_DATA() function still works.  Watch out, though: it needs to copy
   the buffer's contents in a linear chunk before you can use it.

 4.2. More flexible readline support

   The old evbuffer_readline() function (which accepted any sequence of
   CR and LF characters as a newline, and which couldn't handle lines
   containing NUL characters), is now deprecated.  The preferred
   function is evbuffer_readln(), which supports a variety of
   line-ending styles, and which can return the number of characters in
   the line returned.

   You can also call evbuffer_search_eol() to find the end of a line
   in an evbuffer without ever extracting the line.

 4.3. Support for file-based IO in evbuffers.

   You can now add chunks of a file into a evbuffer, and Libevent will have
   your OS use mapped-memory functionality, sendfile, or splice to transfer
   the data without ever copying it to userspace.  On OSs where this is not
   supported, Libevent just loads the data.

   There are probably some bugs remaining in this code.  On some platforms
   (like Windows), it just reads the relevant parts of the file into RAM.

 4.4. Support for zero-copy ("scatter/gather") writes in evbuffers.

   You can add a piece of memory to an evbuffer without copying it.
   Instead, Libevent adds a new element to the evbuffer's linked list of
   chunks with a pointer to the memory you supplied.  You can do this
   either with a reference-counted chunk (via evbuffer_add_reference), or
   by asking Libevent for a pointer to its internal vectors (via
   evbuffer_reserve_space or evbuffer_peek()).

 4.5. Multiple callbacks per evbuffer

   Previously, you could only have one callback active on an evbuffer at a
   time.  In practice, this meant that if one part of Libevent was using an
   evbuffer callback to notice when an internal evbuffer was reading or
   writing data, you couldn't have your own callback on that evbuffer.

   Now, you can now use the evbuffer_add_cb() function to add a callback that
   does not interfere with any other callbacks.

   The evbuffer_setcb() function is now deprecated.

 4.6. New callback interface

   Previously, evbuffer callbacks were invoked with the old size of the
   buffer and the new size of the buffer.  This interface could not capture
   operations that simultaneously filled _and_ drained a buffer, or handle
   cases where we needed to postpone callbacks until multiple operations were
   complete.

   Callbacks that are set with evbuffer_setcb still use the old API.
   Callbacks added with evbuffer_add_cb() use a new interface that takes a
   pointer to a struct holding the total number of bytes drained read and the
   total number of bytes written.  See event2/buffer.h for full details.

 4.7. Misc new evbuffer features

    You can use evbuffer_remove() to move a given number of bytes from one
    buffer to another.

    The evbuffer_search() function lets you search for repeated instances of
    a pattern inside an evbuffer.

    You can use evbuffer_freeze() to temporarily suspend drains from or adds
    to a given evbuffer.  This is useful for code that exposes an evbuffer as
    part of its public API, but wants users to treat it as a pure source or
    sink.

    There's an evbuffer_copyout() that looks at the data at the start of an
    evbuffer without doing a drain.

    You can have an evbuffer defer all of its callbacks, so that rather than
    being invoked immediately when the evbuffer's length changes, they are
    invoked from within the event_loop.  This is useful when you have a
    complex set of callbacks that can change the length of other evbuffers,
    and you want to avoid having them recurse and overflow your stack.

 5. Bufferevents improvements

    Libevent has long included a "bufferevents" structure and related
    functions that were useful for generic buffered IO on a TCP connection.
    This is what Libevent uses for its HTTP implementation.  In addition to
    the improvements that they get for free from the underlying evbuffer
    implementation above, there are many new features in Libevent 2.0's
    evbuffers.

 5.1. New OO implementations

    The "bufferevent" structure is now an abstract base type with multiple
    implementations.  This should not break existing code, which always
    allocated bufferevents with bufferevent_new().

    Current implementations of the bufferevent interface are described below.

 5.2. bufferevent_socket_new() replaces bufferevent_new()

    Since bufferevents that use a socket are not the only kind,
    bufferevent_new() is now deprecated.  Use bufferevent_socket_new()
    instead.

 5.3. Filtered bufferevent IO

    You can use bufferevent_filter_new() to create a bufferevent that wraps
    around another bufferevent and transforms data it is sending and
    receiving.  See test/regress_zlib.c for a toy example that uses zlib to
    compress data before sending it over a bufferevent.

 5.3. Linked pairs of bufferevents

    You can use bufferevent_pair_new() to produce two linked
    bufferevents.  This is like using socketpair, but doesn't require
    system-calls.

 5.4. SSL support for bufferevents with OpenSSL

    There is now a bufferevent type that supports SSL/TLS using the
    OpenSSL library.  The code for this is build in a separate
    library, libevent_openssl, so that your programs don't need to
    link against OpenSSL unless they actually want SSL support.

    There are two ways to construct one of these bufferevents, both
    declared in <event2/bufferevent_ssl.h>.  If you want to wrap an
    SSL layer around an existing bufferevent, you would call the
    bufferevent_openssl_filter_new() function.  If you want to do SSL
    on a socket directly, call bufferevent_openssl_socket_new().

 5.5. IOCP support for bufferevents on Windows

    There is now a bufferevents backend that supports IOCP on Windows.
    Supposedly, this will eventually make Windows IO much faster for
    programs using bufferevents.  We'll have to see; the code is not
    currently optimized at all.  To try it out, call the
    event_base_start_iocp() method on an event_base before contructing
    bufferevents.

    This is tricky code; there are probably some bugs hiding here.

 5.6. Improved connect support for bufferevents.

    You can now create a bufferevent that is not yet connected to any
    host, and tell it to connect, either by address or by hostname.

    The functions to do this are bufferevent_socket_connect and
    bufferevent_socket_connect_hostname.

 5.7. Rate-limiting for bufferevents

    If you need to limit the number of bytes read/written by a single
    bufferevent, or by a group of them, you can do this with a new set of
    bufferevent rate-limiting calls.

 6. Other improvements

 6.1. DNS improvements

 6.1.1. DNS: IPv6 nameservers

    The evdns code now lets you have nameservers whose addresses are IPv6.

 6.1.2. DNS: Better security

    Libevent 2.0 tries harder to resist DNS answer-sniping attacks than
    earlier versions of evdns.  See comments in the code for full details.

    Notably, evdns now supports the "0x20 hack" to make it harder to
    impersonate a DNS server.  Additionally, Libevent now uses a strong
    internal RNG to generate DNS transaction IDs, so you don't need to supply
    your own.

 6.1.3. DNS: Getaddrinfo support

    There's now an asynchronous getaddrinfo clone, evdns_getaddrinfo(),
    to make the results of the evdns functions more usable.  It doesn't
    support every feature of a typical platform getaddrinfo() yet, but it
    is quite close.

    There is also a blocking evutil_getaddrinfo() declared in
    event2/util.h, to provide a getaddrinfo() implementation for
    platforms that don't have one, and smooth over the differences in
    various platforms implementations of RFC3493.

    Bufferevents provide bufferevent_connect_hostname(), which combines
    the name lookup and connect operations.

 6.1.4. DNS: No more evdns globals

    Like an event base, evdns operations are now supposed to use an evdns_base
    argument.  This makes them easier to wrap for other (more OO) languages,
    and easier to control the lifetime of.  The old evdns functions will
    still, of course, continue working.

 6.2. Listener support

    You can now more easily automate setting up a bound socket to listen for
    TCP connections.  Just use the evconnlistener_*() functions in the
    event2/listener.h header.

    The listener code supports IOCP on Windows if available.

 6.3. Secure RNG support

    Network code very frequently needs a secure, hard-to-predict random number
    generator.  Some operating systems provide a good C implementation of one;
    others do not.  Libevent 2.0 now provides a consistent implementation
    based on the arc4random code originally from OpenBSD.  Libevent (and you)
    can use the evutil_secure_rng_*() functions to access a fairly secure
    random stream of bytes.

 6.4. HTTP

    The evhttp uriencoding and uridecoding APIs have updated versions
    that behave more correctly, and can handle strings with internal NULs.

    The evhttp query parsing and URI parsing logic can now detect errors
    more usefully.  Moreover, we include an actual URI parsing function
    (evhttp_uri_parse()) to correctly parse URIs, so as to discourage
    people from rolling their own ad-hoc parsing functions.

    There are now accessor functions for the useful fields of struct http
    and friends; it shouldn't be necessary to access them directly any
    more.

    Libevent now lets you declare support for all specified HTTP methods,
    including OPTIONS, PATCH, and so on.  The default list is unchanged.

    Numerous evhttp bugs also got fixed.

 7. Infrastructure improvements

 7.1. Better unit test framework

    We now use a unit test framework that Nick wrote called "tinytest".
    The main benefit from Libevent's point of view is that tests which
    might mess with global state can all run each in their own
    subprocess.  This way, when there's a bug that makes one unit test
    crash or mess up global state, it doesn't affect any others.

 7.2. Better unit tests

    Despite all the code we've added, our unit tests are much better than
    before.  Right now, iterating over the different backends on various
    platforms, I'm getting between 78% and 81% test coverage, compared
    with less than 45% test coverage in Libevent 1.4.