This is a virtio protocol definition, companion Linux guest kernel driver, and set of host-side devices for virtualizing media devices using virtio, following the same model (and structures) as V4L2. It can be used to virtualize cameras, codec devices, or any other device supported by V4L2.
Want to try it? See the TRY_IT_OUT document.
V4L2 is a UAPI that allows a less privileged entity (user-space) to use video hardware exposed by a more privileged entity (the kernel). Virtio-media is an encapsulation of this API into virtio, turning it into a virtualization API for all classes of video devices supported by V4L2, where the host plays the role of the kernel and the guest the role of user-space.
The host is therefore responsible for presenting a virtual device that behaves like an actual V4L2 device, which the guest can control.
This repository includes a simple guest Linux kernel module supporting this protocol. On the host side, devices can be implemented in several ways:
Note that virtio-media does not require the use of a V4L2 device driver or of Linux on the host or guest side - V4L2 is only used as a host-guest protocol, and both sides are free to convert it from/to any model that they wish to use.
The complete definition of V4L2 structures and ioctls can be found under the V4L2 UAPI documentation, which should be referred to alongside this document.
The driver (in the driver/
directory) should be working and supporting most V4L2 features, with the exception of the following:
VIDIOC_OVERLAY
VIDIOC_G/S_FBUF
DMABUF
buffers (this will be supported at least for virtio objects, other kinds of DMABUFs may or may not be usable)VIDIOC_EXPBUF
(to be implemented)VIDIOC_G/S_EDID
(to be implemented if it makes sense in a virtual context)The devices/
directory contains a Rust crate implementing helper functions to parse the protocol, interfaces to easily implement devices, and a couple of device implementations. It is written to be easily pluggable on any VMM. See the rustdoc in the devices/
directory for more information.
Implemented devices are:
simple_device.rs
), purely software-based and thus not requiring any kind of hardware. This is here for reference and testing purposes.v4l2_device_proxy.rs
).extras/ffmpeg-decoder
.Virtio-media uses device ID 48
.
There are two queues in use:
0 : commandq - queue for driver commands and device responses to these commands. The device MUST return the descriptor chains it receives as soon as possible, and must never hold to them for indefinite periods of time.
1 : eventq - queue for events sent by the device to the driver. The driver MUST re-queue the descriptor chains returned by the device as soon as possible, and must never hold on them for indefinite periods of time.
The configuration area contains the following information:
struct virtio_v4l2_config { /// The device_caps field of struct video_device. u32 device_caps; /// The vfl_devnode_type of the device. u32 device_type; /// The `card` field of v4l2_capability. u8 card[32]; }
Shared memory region 0
is used to map MMAP
buffers into the guest using the VIRTIO_MEDIA_CMD_MMAP
command. If the host does not provide it, then MMAP
buffers cannot be mapped into the guest.
All structures managing the virtio protocol are defined and documented in protocol.h
. Please refer to this file whenever a virtio_media_cmd_*
or virtio_media_resp_*
structure is mentioned.
Commands are queued on the commandq
by the driver for the device to process. They all start by an instance of struct virtio_media_cmd_header
and include device-writable descriptors for the device to write the result of the command in a struct virtio_media_resp_header
.
The errors returned by each command are standard Linux kernel error codes. For instance, a command that contains invalid options will return EINVAL
.
Events are sent on the eventq
by the device for the driver to handle. They all start by an instance of struct virtio_media_event_header
.
In order to use the device, the driver needs to open a session. This act is equivalent to opening the /dev/videoX
device file of the V4L2 device. Depending on the type of device, it may be possible to open several sessions concurrently.
A session is opened by queueing a struct virtio_media_cmd_open
along with a descriptor to receive a struct virtio_media_resp_open
to the commandq. An open session can be closed with struct virtio_media_cmd_close
.
While the session is opened, its ID can be used to perform actions on it, most commonly V4L2 ioctls.
Ioctls are the main way to interact with V4L2 devices, and therefore virtio-media features a command to perform an ioctl on an open session.
In order to perform an ioctl, the driver queues a struct virtio_media_cmd_ioctl
along with a descriptor to receive a struct virtio_media_resp_ioctl
on the commandq. The code of the ioctl can be extracted from the videodev2.h header file, which defines the ioctls' codes, type of payload, and direction. For instance, the VIDIOC_G_FMT
ioctl is defined as follows:
#define VIDIOC_G_FMT _IOWR('V', 4, struct v4l2_format)
This tells us that its ioctl code is 4
, that its payload is a struct v4l2_format
, and that its direction is WR
, i.e. the payload is written by both the driver and the device.
The payload layout is always a 64-bit representation of the corresponding V4L2 structure, irrespective of the host and guest architecture.
The payload of an ioctl in the descriptor chain follows the command structure, the reponse structure, or both depending on the direction:
_IOR
ioctl is read-only for the driver, meaning the payload follows the response in the device-writable section of the descriptor chain._IOW
ioctl is read-only for the device, meaning the payload follows the command in the driver-writable section of the descriptor chain._IORW
ioctl is writable by both the device and driver, meaning the payload must follow both the command in the driver-writable section of the descriptor chain, and the response in the device-writable section.For instance, the VIDIOC_G_STD
ioctl is defined as follows:
#define VIDIOC_G_STD _IOR('V', 23, v4l2_std_id)
Thus, its layout on the commandq will be:
+-------------------------------------+ | struct virtio_media_cmd_ioctl | +=====================================+ | struct virtio_media_resp_ioctl | +-------------------------------------+ | v4l2_std_id | +-------------------------------------+
(in these diagrams, the ====
line signals the delimitation between device-readable and device-writable descriptors).
VIDIOC_SUBSCRIBE_EVENT
is defined as follows:
#define VIDIOC_SUBSCRIBE_EVENT _IOW('V', 90, struct v4l2_event_subscription)
Meaning its layout on the commandq will be:
+-------------------------------------+ | struct virtio_media_cmd_ioctl | +-------------------------------------+ | struct v4l2_event_subscription | +=====================================+ | struct virtio_media_resp_ioctl | +-------------------------------------+
Finally, VIDIOC_G_FMT
is a WR
ioctl:
#define VIDIOC_G_FMT _IOWR('V', 4, struct v4l2_format)
Its layout on the commandq will thus be:
+-------------------------------------+ | struct virtio_media_cmd_ioctl | +-------------------------------------+ | struct v4l2_format | +=====================================+ | struct virtio_media_resp_ioctl | +-------------------------------------+ | struct v4l2_format | +-------------------------------------+
A common optimization for WR
ioctls is to provide the payload using descriptors that both point to the same buffer. This mimics the behavior of V4L2 ioctls where the data is only passed once and used as both input and output by the kernel.
In case of success, the device MUST always write the payload in the device-writable part of the descriptor chain.
In case of failure, the device is free to write the payload in the device-writable part of the descriptor chain or not. Some errors may still result in the payload being updated, and in this case the device is expected to write the updated payload (for instance, G_EXT_CTRLS
may return EINVAL
but update the size
member of the requested controls if the provided size was not enough). If the device has not written the payload after an error, the driver MUST assume that the payload has not been modified.
A few structures used as ioctl payloads contain pointers the link to further data needed for the ioctl. There are notably:
planes
pointer of struct v4l2_buffer
, which size is determined by the length
member,controls
pointer of struct v4l2_ext_controls
, which size is determined by the count
member.If the size of the pointed area is determined to be non-zero, then the main payload is immediately followed by the pointed data in their order of appearance in the structure, and the pointer value itself is ignored by the device, which must also return the value initially passed by the driver. For instance, for a struct v4l2_ext_controls
which count
is 16
:
+--------------------------------------+ | struct v4l2_ext_controls | +--------------------------------------+ | struct v4l2_ext_control for plane 0 | | struct v4l2_ext_control for plane 1 | | ... | | struct v4l2_ext_control for plane 15 | +--------------------------------------+
Similarly, a multiplanar struct v4l2_buffer
with its length
member set to 3 will be laid out as follows:
+-------------------------------------+ | struct v4l2_buffer | +-------------------------------------+ | struct v4l2_plane for plane 0 | | struct v4l2_plane for plane 1 | | struct v4l2_plane for plane 2 | +-------------------------------------+
A few pointers are special in that they point to userspace memory. They are:
m.userptr
member of struct v4l2_buffer
and struct v4l2_plane
(technically an unsigned long
, but designated a userspace address),ptr
member of struct v4l2_ext_ctrl
.These pointers can cover large areas of scattered memory, which has the potential to require more descriptors than the virtio queue can provide. For these particular pointers only, a list of struct virtio_media_sg_entry
that covers the needed amount of memory for the pointer is used instead of using descriptors to map the pointed memory directly.
For each such pointer to read, the device reads as many SG entries as needed to cover the length of the pointed buffer, as described by its parent structure (length
member of struct v4l2_buffer
or struct v4l2_plane
for buffer memory, and size
member of struct v4l2_ext_control
for control data).
Since the device never needs to modify the list of SG entries, it is only provided by the driver in the device-readable section of the descriptor chain, and not repeated in the device-writable section, even for WR
ioctls.
To illustrate the data layout, here is what the descriptor chain of a VIDIOC_QBUF
ioctl queueing a 3-planar USERPTR
buffer would look like:
+---------------------------------------------------+ | struct virtio_media_cmd_ioctl | +---------------------------------------------------+ | struct v4l2_buffer | +---------------------------------------------------+ | struct v4l2_plane for plane 0 | | struct v4l2_plane for plane 1 | | struct v4l2_plane for plane 2 | +---------------------------------------------------+ | array of struct virtio_media_sg_entry for plane 0 | +---------------------------------------------------+ | array of struct virtio_media_sg_entry for plane 1 | +---------------------------------------------------+ | array of struct virtio_media_sg_entry for plane 2 | +===================================================+ | struct virtio_media_resp_ioctl | +---------------------------------------------------+ | struct v4l2_buffer | +---------------------------------------------------+ | struct v4l2_plane for plane 0 | | struct v4l2_plane for plane 1 | | struct v4l2_plane for plane 2 | +---------------------------------------------------+
Since the ioctl is RW
, the payload is repeated in both the device-readable and device-writable sections of the descriptor chain, but the device-writable section does not include the SG lists to guest memory.
A few ioctls are replaced by other, more suitable mechanisms. If being requested these ioctls, the device must return the same response as it would for an unknown ioctl, i.e. ENOTTY
.
VIDIOC_QUERYCAP
is replaced by reading the configuration area.VIDIOC_DQBUF
is replaced by a dedicated event.VIDIOC_DQEVENT
is replaced by a dedicated event.VIDIOC_G_JPEGCOMP
and VIDIOC_S_JPEGCOMP
are deprecated and replaced by the controls of the JPEG class.VIDIOC_LOG_STATUS
is a guest-only operation and shall not be implemented by the host.Events are a way for the device to inform the driver about asynchronous events that it should know about. In virtio-media, they are used as a replacement for the VIDIOC_DQBUF
and VIDIOC_DQEVENT
ioctls and the polling mechanism, which would be impractical to implement on top of virtio.
A struct virtio_media_event_dqbuf
event is queued on the eventq by the device every time a buffer previously queued using the VIDIOC_QBUF
ioctl is done being processed and can be used by the driver again. This is like an implicit VIDIOC_DQBUF
ioctl.
Pointer values in the struct v4l2_buffer
and struct v4l2_plane
are meaningless and must be ignored by the driver. It is recommended that the device sets them to NULL
in order to avoid leaking potential host addresses.
Note that in the case of a USERPTR
buffer, the struct v4l2_buffer
used as event payload is not followed by the buffer memory: since that memory is the same that the driver submitted with the VIDIOC_QBUF
, it would be redundant to have it here.
A struct virtio_media_event_event
event is queued on the eventq by the device every time an event the driver previously subscribed to using the VIDIOC_SUBSCRIBE_EVENT
ioctl has been signaled. This is like an implicit VIDIOC_DQEVENT
ioctl.
The semantics of the three V4L2 memory types (MMAP
, USERPTR
and DMABUF
) can easily be mapped to a guest/host context.
In virtio-media, MMAP
buffers are provisioned by the host, just like they are by the kernel in regular V4L2. Similarly to how userspace can map a MMAP
buffer into its address space using mmap
and munmap
, the virtio-media driver can map host buffers into the guest space by queueing the struct virtio_media_cmd_mmap
and struct virtio_media_cmd_munmap
commands to the commandq.
In virtio-media, USERPTR
buffers and provisioned by the guest, just like they are by userspace in regular V4L2. Instances of struct v4l2_buffer
and struct v4l2_plane
of this type are followed by a series of descriptors mapping the buffer backing memory in guest space.
For the host convenience, the backing memory must start with a new descriptor - this allows the host to easily map the buffer memory to render into it instead of having to do a copy.
The host must not alter the pointer values provided by the guest, i.e. the m.userptr
member of struct v4l2_buffer
and struct v4l2_plane
must be returned to the guest with the same value as it was provided.
In virtio-media, DMABUF
buffers are provisioned by a virtio object, just like they are by a DMABUF in regular V4L2. Virtio objects are 16-bytes UUIDs and do not fit in the placeholders for file descriptors, so they follow their embedding data structure as needed and the device must leave the V4L2 structure placeholder unchanged. For instance, a 3-planar struct v4l2_buffer
with the V4L2_MEMORY_DMABUF
memory type will have the following layout:
+-------------------------------------+ | struct v4l2_buffer | +-------------------------------------+ | struct v4l2_plane for plane 0 | | struct v4l2_plane for plane 1 | | struct v4l2_plane for plane 2 | +-------------------------------------+ | 16 byte UUID for plane 0 | +-------------------------------------+ | 16 byte UUID for plane 1 | +-------------------------------------+ | 16 byte UUID for plane 2 | +-------------------------------------+
Contrary to USERPTR
buffers, virtio objects UUIDs need to be added in both the device-readable and device-writable section of the descriptor chain.
Host-allocated buffers with the V4L2_MEMORY_MMAP
memory type can also be exported as virtio objects for use with another virtio device using the VIDIOC_EXPBUF
ioctl. The fd
placefolder of v4l2_exportbuffer
means that space for the UUID needs to be reserved right after that structure, so the ioctl layout will looks as follows:
+-------------------------------------+ | struct virtio_media_cmd_ioctl | +-------------------------------------+ | struct v4l2_exportbuffer | +-------------------------------------+ | 16 bytes UUID for exported buffer | +=====================================+ | struct virtio_media_resp_ioctl | +-------------------------------------+ | struct v4l2_exportbuffer | +-------------------------------------+ | 16 bytes UUID for exported buffer | +-------------------------------------+