Re: [RFC PATCH 01/11] rust: add abstraction for struct device
From: Danilo Krummrich
Date: Mon May 20 2024 - 16:22:50 EST
On Mon, May 20, 2024 at 08:00:23PM +0200, Greg KH wrote:
> On Mon, May 20, 2024 at 07:25:38PM +0200, Danilo Krummrich wrote:
> > Add an (always) reference counted abstraction for a generic struct
> > device. This abstraction encapsulates existing struct device instances
> > and manages its reference count.
> >
> > Subsystems may use this abstraction as a base to abstract subsystem
> > specific device instances based on a generic struct device.
> >
> > Co-developed-by: Wedson Almeida Filho <wedsonaf@xxxxxxxxx>
> > Signed-off-by: Wedson Almeida Filho <wedsonaf@xxxxxxxxx>
> > Signed-off-by: Danilo Krummrich <dakr@xxxxxxxxxx>
> > ---
> > rust/helpers.c | 1 +
> > rust/kernel/device.rs | 76 +++++++++++++++++++++++++++++++++++++++++++
>
> What's the status of moving .rs files next to their respective .c files
> in the build system? Keeping them separate like this just isn't going
> to work, sorry.
>
> > --- /dev/null
> > +++ b/rust/kernel/device.rs
> > @@ -0,0 +1,76 @@
> > +// SPDX-License-Identifier: GPL-2.0
> > +
> > +//! Generic devices that are part of the kernel's driver model.
> > +//!
> > +//! C header: [`include/linux/device.h`](../../../../include/linux/device.h)
>
> relative paths for a common "include <linux/device.h" type of thing?
> Rust can't handle include paths from directories?
Going to change this to `srctree/` as proposed by Miguel.
>
> > +
> > +use crate::{
> > + bindings,
> > + types::{ARef, Opaque},
> > +};
> > +use core::ptr;
> > +
> > +/// A ref-counted device.
> > +///
> > +/// # Invariants
> > +///
> > +/// The pointer stored in `Self` is non-null and valid for the lifetime of the ARef instance. In
> > +/// particular, the ARef instance owns an increment on underlying object’s reference count.
> > +#[repr(transparent)]
> > +pub struct Device(Opaque<bindings::device>);
> > +
> > +impl Device {
> > + /// Creates a new ref-counted instance of an existing device pointer.
> > + ///
> > + /// # Safety
> > + ///
> > + /// Callers must ensure that `ptr` is valid, non-null, and has a non-zero reference count.
>
> Callers NEVER care about the reference count of a struct device, anyone
> poking in that is asking for trouble.
That's confusing, if not the caller who's passing the device pointer somewhere,
who else?
Who takes care that a device' reference count is non-zero when a driver's probe
function is called?
It's the same here. The PCI code calls Device::from_raw() from its
probe_callback() function, which is called from the C side. For instance:
extern "C" fn probe_callback(
pdev: *mut bindings::pci_dev,
id: *const bindings::pci_device_id,
) -> core::ffi::c_int {
// SAFETY: This is safe, since the C side guarantees that pdev is a valid,
// non-null pointer to a struct pci_dev with a non-zero reference count.
let dev = unsafe { device::Device::from_raw(&mut (*pdev).dev) };
[...]
}
>
> And why non-NULL? Can't you check for that here? Shouldn't you check
> for that here? Many driver core functions can handle a NULL pointer
> just fine (i.e. get/put_device() can), why should Rust code assume that
> a pointer passed to it from the C layer is going to have stricter rules
> than the C layer can provide?
We could check for NULL here, but I think it'd be pointless. Even if the pointer
is not NULL, it can still be an invalid one. There is no check we can do to
guarantee safety, hence the function is and remains unsafe and has safety
requirements instead that the caller must guarantee to fulfil.
Like in the example above, probe_callback() can give those guarantees instead.
>
> > + pub unsafe fn from_raw(ptr: *mut bindings::device) -> ARef<Self> {
> > + // SAFETY: By the safety requirements, ptr is valid.
> > + // Initially increase the reference count by one to compensate for the final decrement once
> > + // this newly created `ARef<Device>` instance is dropped.
> > + unsafe { bindings::get_device(ptr) };
> > +
> > + // CAST: `Self` is a `repr(transparent)` wrapper around `bindings::device`.
> > + let ptr = ptr.cast::<Self>();
> > +
> > + // SAFETY: By the safety requirements, ptr is valid.
> > + unsafe { ARef::from_raw(ptr::NonNull::new_unchecked(ptr)) }
> > + }
> > +
> > + /// Obtain the raw `struct device *`.
> > + pub(crate) fn as_raw(&self) -> *mut bindings::device {
> > + self.0.get()
> > + }
> > +
> > + /// Convert a raw `struct device` pointer to a `&Device`.
> > + ///
> > + /// # Safety
> > + ///
> > + /// Callers must ensure that `ptr` is valid, non-null, and has a non-zero reference count for
> > + /// the entire duration when the returned reference exists.
>
This is the doc comment of pub unsafe fn as_ref<'a>(ptr: *mut bindings::device)
-> &'a Self. Let's keep this context, it's confusing for other readers
otherwise.
> Again, non-NULL might not be true, and reference counts are never
Like above, it's just the safety precondition. Checking for NULL does not
improve the situation, we still need to rely on the pointer being a valid one.
> tracked by any user EXCEPT to increment/decrement it, you never know if
That's the whole point, if one takes an increment on the reference count they
can guarantee it's non-zero.
> it is 0 or not, all you know is that if a pointer is given to you by the
> driver core to a 'struct device' for a function that it is a valid
> reference at that point in time, or maybe NULL, until your function
> returns. Anything after that can not be counted on.
That's not contradictive to those safety comments. When Device::from_raw()
returns it has increased the reference count of the device by one. And when
Device::as_ref() returns the returned reference' lifetime is bound to the
lifetime of the pointer passed to it.
>
> thanks,
>
> greg k-h
>