ad7877: keep dma rx buffers in seperate cache lines [Kernel]

Prev: HELLO
Next: mxc: gadget: remove 60mhz clock requirement for freescale mx51 usb core

From: Mike Frysinger on 6 May 2010 14:50

On Thu, May 6, 2010 at 06:37, Oskar Schirmer wrote:
> Â struct ser_req {
> + Â Â Â u16 Â Â Â Â Â Â Â Â Â Â sample;
> + Â Â Â char Â Â Â Â Â Â Â Â Â Â __padalign[L1_CACHE_BYTES - sizeof(u16)];
> +
> Â Â Â Â u16 Â Â Â Â Â Â Â Â Â Â reset;
> Â Â Â Â u16 Â Â Â Â Â Â Â Â Â Â ref_on;
> Â Â Â Â u16 Â Â Â Â Â Â Â Â Â Â command;
> - Â Â Â u16 Â Â Â Â Â Â Â Â Â Â sample;
> Â Â Â Â struct spi_message Â Â Â msg;
> Â Â Â Â struct spi_transfer Â Â xfer[6];
> Â };

are you sure this is necessary ? ser_req is only ever used with
spi_sync() and it's allocated/released on the fly, so how could
anything be reading that memory between the start of the transmission
and the return to adi7877 ?

> Â struct ad7877 {
> + Â Â Â u16 Â Â Â Â Â Â Â Â Â Â conversion_data[AD7877_NR_SENSE];
> + Â Â Â char Â Â Â Â Â Â Â Â Â Â __padalign[L1_CACHE_BYTES
> + Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â - AD7877_NR_SENSE * sizeof(u16)];
> +
> Â Â Â Â struct input_dev Â Â Â Â *input;
> Â Â Â Â char Â Â Â Â Â Â Â Â Â Â phys[32];
>
> @@ -182,8 +188,6 @@ struct ad7877 {
> Â Â Â Â u8 Â Â Â Â Â Â Â Â Â Â Â averaging;
> Â Â Â Â u8 Â Â Â Â Â Â Â Â Â Â Â pen_down_acc_interval;
>
> - Â Â Â u16 Â Â Â Â Â Â Â Â Â Â conversion_data[AD7877_NR_SENSE];
> -
> Â Â Â Â struct spi_transfer Â Â xfer[AD7877_NR_SENSE + 2];
> Â Â Â Â struct spi_message Â Â Â msg;

i can see the spi_message inside of this struct being a problem
because the spi transfer is doing asynchronously with spi_async().
however, i would add a comment right above these two fields with a
short explanation as to why they're at the start and why the pad
exists so someone down the line doesnt move it.
-mike

From: Oskar Schirmer on 7 May 2010 06:20

On Thu, May 06, 2010 at 14:46:04 -0400, Mike Frysinger wrote:
> On Thu, May 6, 2010 at 06:37, Oskar Schirmer wrote:
> > struct ser_req {
> > + u16 sample;
> > + char __padalign[L1_CACHE_BYTES - sizeof(u16)];
> > +
> > u16 reset;
> > u16 ref_on;
> > u16 command;
> > - u16 sample;
> > struct spi_message msg;
> > struct spi_transfer xfer[6];
> > };
>
> are you sure this is necessary ? ser_req is only ever used with
> spi_sync() and it's allocated/released on the fly, so how could
> anything be reading that memory between the start of the transmission
> and the return to adi7877 ?

msg is handed over to spi_sync, it contains the addresses
which will be used to programme the DMA: the spi master
transfer function will read these fields to start DMA.

E.g., drivers/spi/atmel_spi.c, assures cache coherency
in function atmel_spi_dma_map_xfer, one xfer at a time.
Multiple transfers are worked on in a loop in atmel_spi_transfer,
so when coherency for transfer #1 is ensured, addresses
for transfer #2 are read from the msg and xfer structs,
caching lines which have been just before invalidated.

And, citing Documentation/DMA-API.txt, Part Id:
"mapped region must begin exactly on a cache line
boundary and end exactly on one", which is our case.

>
> > struct ad7877 {
> > + u16 conversion_data[AD7877_NR_SENSE];
> > + char __padalign[L1_CACHE_BYTES
> > + - AD7877_NR_SENSE * sizeof(u16)];
> > +
> > struct input_dev *input;
> > char phys[32];
> >
> > @@ -182,8 +188,6 @@ struct ad7877 {
> > u8 averaging;
> > u8 pen_down_acc_interval;
> >
> > - u16 conversion_data[AD7877_NR_SENSE];
> > -
> > struct spi_transfer xfer[AD7877_NR_SENSE + 2];
> > struct spi_message msg;
>
> i can see the spi_message inside of this struct being a problem
> because the spi transfer is doing asynchronously with spi_async().
> however, i would add a comment right above these two fields with a
> short explanation as to why they're at the start and why the pad
> exists so someone down the line doesnt move it.

The code says "pad to align according to L1 cache, and
keep away other stuff by exactly the amount so it is
off the line". I'ld guess a comment would repeat just
this, so it is superfluous. But if opinions differ on
this topic, we can have a comment added, sure.

Oskar
--
oskar schirmer, emlix gmbh, http://www.emlix.com
fon +49 551 30664-0, fax -11, bahnhofsallee 1b, 37081 göttingen, germany
sitz der gesellschaft: göttingen, amtsgericht göttingen hr b 3160
geschäftsführer: dr. uwe kracke, ust-idnr.: de 205 198 055

emlix - your embedded linux partner
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Johannes Weiner on 7 May 2010 08:10

On Thu, May 06, 2010 at 02:46:04PM -0400, Mike Frysinger wrote:
> On Thu, May 6, 2010 at 06:37, Oskar Schirmer wrote:
> > �struct ser_req {
> > + � � � u16 � � � � � � � � � � sample;
> > + � � � char � � � � � � � � � �__padalign[L1_CACHE_BYTES - sizeof(u16)];
> > +
> > � � � �u16 � � � � � � � � � � reset;
> > � � � �u16 � � � � � � � � � � ref_on;
> > � � � �u16 � � � � � � � � � � command;
> > - � � � u16 � � � � � � � � � � sample;
> > � � � �struct spi_message � � �msg;
> > � � � �struct spi_transfer � � xfer[6];
> > �};
>
> are you sure this is necessary ? ser_req is only ever used with
> spi_sync() and it's allocated/released on the fly, so how could
> anything be reading that memory between the start of the transmission
> and the return to adi7877 ?

The master driver can. atmel_spi flushes the cache of the buffers
pretty early when queuing a message and touches the message members
afterwards.

> > �struct ad7877 {
> > + � � � u16 � � � � � � � � � � conversion_data[AD7877_NR_SENSE];
> > + � � � char � � � � � � � � � �__padalign[L1_CACHE_BYTES
> > + � � � � � � � � � � � � � � � � � � � - AD7877_NR_SENSE * sizeof(u16)];
> > +
> > � � � �struct input_dev � � � �*input;
> > � � � �char � � � � � � � � � �phys[32];
> >
> > @@ -182,8 +188,6 @@ struct ad7877 {
> > � � � �u8 � � � � � � � � � � �averaging;
> > � � � �u8 � � � � � � � � � � �pen_down_acc_interval;
> >
> > - � � � u16 � � � � � � � � � � conversion_data[AD7877_NR_SENSE];
> > -
> > � � � �struct spi_transfer � � xfer[AD7877_NR_SENSE + 2];
> > � � � �struct spi_message � � �msg;
>
> i can see the spi_message inside of this struct being a problem
> because the spi transfer is doing asynchronously with spi_async().
> however, i would add a comment right above these two fields with a
> short explanation as to why they're at the start and why the pad
> exists so someone down the line doesnt move it.

Good idea.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Mike Frysinger on 7 May 2010 14:30

On Fri, May 7, 2010 at 06:15, Oskar Schirmer wrote:
> On Thu, May 06, 2010 at 14:46:04 -0400, Mike Frysinger wrote:
>> On Thu, May 6, 2010 at 06:37, Oskar Schirmer wrote:
>> > struct ser_req {
>> > + u16 sample;
>> > + char __padalign[L1_CACHE_BYTES - sizeof(u16)];
>> > +
>> > u16 reset;
>> > u16 ref_on;
>> > u16 command;
>> > - u16 sample;
>> > struct spi_message msg;
>> > struct spi_transfer xfer[6];
>> > };
>>
>> are you sure this is necessary ? ser_req is only ever used with
>> spi_sync() and it's allocated/released on the fly, so how could
>> anything be reading that memory between the start of the transmission
>> and the return to adi7877 ?
>
> msg is handed over to spi_sync, it contains the addresses
> which will be used to programme the DMA: the spi master
> transfer function will read these fields to start DMA.

so the issue is coming from the SPI master drivers and not the AD7877 driver

>> > struct ad7877 {
>> > + u16 conversion_data[AD7877_NR_SENSE];
>> > + char __padalign[L1_CACHE_BYTES
>> > + - AD7877_NR_SENSE * sizeof(u16)];
>> > +
>> > struct input_dev *input;
>> > char phys[32];
>> >
>> > @@ -182,8 +188,6 @@ struct ad7877 {
>> > u8 averaging;
>> > u8 pen_down_acc_interval;
>> >
>> > - u16 conversion_data[AD7877_NR_SENSE];
>> > -
>> > struct spi_transfer xfer[AD7877_NR_SENSE + 2];
>> > struct spi_message msg;
>>
>> i can see the spi_message inside of this struct being a problem
>> because the spi transfer is doing asynchronously with spi_async().
>> however, i would add a comment right above these two fields with a
>> short explanation as to why they're at the start and why the pad
>> exists so someone down the line doesnt move it.
>
> The code says "pad to align according to L1 cache, and
> keep away other stuff by exactly the amount so it is
> off the line". I'ld guess a comment would repeat just
> this, so it is superfluous. But if opinions differ on
> this topic, we can have a comment added, sure.

not everyone knows to read every single piece of documentation that
may or may not be affected implicitly in the call stack. a simple
comment here is not superfluous.

since the other struct is also going to be changed, a comment should
be placed there as well.
-mike
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

From: Johannes Weiner on 8 May 2010 18:40

On Fri, May 07, 2010 at 02:28:16PM -0400, Mike Frysinger wrote:
> On Fri, May 7, 2010 at 06:15, Oskar Schirmer wrote:
> > On Thu, May 06, 2010 at 14:46:04 -0400, Mike Frysinger wrote:
> >> On Thu, May 6, 2010 at 06:37, Oskar Schirmer wrote:
> >> > �struct ser_req {
> >> > + � � � u16 � � � � � � � � � � sample;
> >> > + � � � char � � � � � � � � � �__padalign[L1_CACHE_BYTES - sizeof(u16)];
> >> > +
> >> > � � � �u16 � � � � � � � � � � reset;
> >> > � � � �u16 � � � � � � � � � � ref_on;
> >> > � � � �u16 � � � � � � � � � � command;
> >> > - � � � u16 � � � � � � � � � � sample;
> >> > � � � �struct spi_message � � �msg;
> >> > � � � �struct spi_transfer � � xfer[6];
> >> > �};
> >>
> >> are you sure this is necessary ? �ser_req is only ever used with
> >> spi_sync() and it's allocated/released on the fly, so how could
> >> anything be reading that memory between the start of the transmission
> >> and the return to adi7877 ?
> >
> > msg is handed over to spi_sync, it contains the addresses
> > which will be used to programme the DMA: the spi master
> > transfer function will read these fields to start DMA.
>
> so the issue is coming from the SPI master drivers and not the AD7877 driver

No, the issue is coming from ad7877 placing a transmission buffer
into the same cache line with memory locations that are accessed outside
the driver's scope. It must assume that the SPI driver does DMA, that
cache coherency is maintained at cache line granularity, and must not
make any assumptions about how the struct spi_message members are used.

The following is about slave drivers from Documentation/spi/spi-summary:

- Follow standard kernel rules, and provide DMA-safe buffers in
your messages. That way controller drivers using DMA aren't forced
to make extra copies unless the hardware requires it (e.g. working
around hardware errata that force the use of bounce buffering).

> >> > �struct ad7877 {
> >> > + � � � u16 � � � � � � � � � � conversion_data[AD7877_NR_SENSE];
> >> > + � � � char � � � � � � � � � �__padalign[L1_CACHE_BYTES
> >> > + � � � � � � � � � � � � � � � � � � � - AD7877_NR_SENSE * sizeof(u16)];
> >> > +
> >> > � � � �struct input_dev � � � �*input;
> >> > � � � �char � � � � � � � � � �phys[32];
> >> >
> >> > @@ -182,8 +188,6 @@ struct ad7877 {
> >> > � � � �u8 � � � � � � � � � � �averaging;
> >> > � � � �u8 � � � � � � � � � � �pen_down_acc_interval;
> >> >
> >> > - � � � u16 � � � � � � � � � � conversion_data[AD7877_NR_SENSE];
> >> > -
> >> > � � � �struct spi_transfer � � xfer[AD7877_NR_SENSE + 2];
> >> > � � � �struct spi_message � � �msg;
> >>
> >> i can see the spi_message inside of this struct being a problem
> >> because the spi transfer is doing asynchronously with spi_async().
> >> however, i would add a comment right above these two fields with a
> >> short explanation as to why they're at the start and why the pad
> >> exists so someone down the line doesnt move it.
> >
> > The code says "pad to align according to L1 cache, and
> > keep away other stuff by exactly the amount so it is
> > off the line". I'ld guess a comment would repeat just
> > this, so it is superfluous. But if opinions differ on
> > this topic, we can have a comment added, sure.
>
> not everyone knows to read every single piece of documentation that
> may or may not be affected implicitly in the call stack. a simple
> comment here is not superfluous.
>
> since the other struct is also going to be changed, a comment should
> be placed there as well.

How about

/*
* DMA (thus cache coherency maintainance) requires the
* transfer buffers to live in their own cache lines.
*/
char __padalign[...];

? It might be obvious what the code does, but I agree with
Mike that it might not be immediately apparent why it's needed.

Hannes
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

| Next | Last
Pages: 1 2 3 4 5 6 7 8
Prev: HELLO
Next: mxc: gadget: remove 60mhz clock requirement for freescale mx51 usb core