From: matt.reilly on
I'll try to catch up on a few of the questions and conjectures:

1. The SiCortex founders (Jud Leonard, John Mucci, and I) had worked
before at
Digital (all three), Thinking Machines (John), Symbolics (Jud) and
other places.
We've since added folks from a lot of different places. (But I can't
think of any
Apollo alums, though I don't have the whole list of folks at my
fingertips.)

2. The DMA engine microcode is published. The brave and willing can
certainly
use it as a starting place for new approaches.

3. The systems are designed with a focus on reliability, low power,
deployability,
and good price performance for applications that scale to hundreds or
thousands
of processors. (We also fit a few other profiles that go beyond this
focus, but this
is what we were thinking when we designed and built the product.)

4. We implement MPI because that is what the major part of our target
markets
requires. We do pretty well at it too. The MPI implementation (based
on MPICH)
talks to the DMA engine through a work-queue based interface. The DMA
engine
microcode also supports IP over the SiCortex fabric and a high
performance
communication path to support the Lustre parallel file system.

From: already5chosen on
Matt,
Could you comment on Paul Gotch's speculations above (about MIPS
20Kc) ?
From: Nick Maclaren on

In article <aa530dfa-c87f-4569-a915-f23fd2be8548(a)k37g2000hsf.googlegroups.com>,
matt.reilly(a)sicortex.com writes:
|>
|> We've since added folks from a lot of different places. (But I can't
|> think of any
|> Apollo alums, though I don't have the whole list of folks at my
|> fingertips.)

Ah. My source got it wrong, then.


Regards,
Nick Maclaren.
From: Chris Thomasson on
<matt.reilly(a)sicortex.com> wrote in message
news:aa530dfa-c87f-4569-a915-f23fd2be8548(a)k37g2000hsf.googlegroups.com...
> I'll try to catch up on a few of the questions and conjectures:
>
> 1. The SiCortex founders (Jud Leonard, John Mucci, and I) had worked
> before at
> Digital (all three), Thinking Machines (John), Symbolics (Jud) and
> other places.
> We've since added folks from a lot of different places. (But I can't
> think of any
> Apollo alums, though I don't have the whole list of folks at my
> fingertips.)

A flood of talented people simply cannot hurt anyone!

:^D




> 2. The DMA engine microcode is published. The brave and willing can
> certainly
> use it as a starting place for new approaches.

Perfect.




> 3. The systems are designed with a focus on reliability, low power,
> deployability,
> and good price performance for applications that scale to hundreds or
> thousands
> of processors. (We also fit a few other profiles that go beyond this
> focus, but this
> is what we were thinking when we designed and built the product.)

IMHO, your systems seem to make a lot of sense.




> 4. We implement MPI because that is what the major part of our target
> markets
> requires. We do pretty well at it too. The MPI implementation (based
> on MPICH)
> talks to the DMA engine through a work-queue based interface. The DMA
> engine
> microcode also supports IP over the SiCortex fabric and a high
> performance
> communication path to support the Lustre parallel file system.

What do you think about my initial idea on how to program your systems? That
is, using advanced shared-memory multi-threading techniques for intra-node
communication, and MPI for inter-node communication... I think it should
work very well. I am always interested in being able to create and play
around with my own algorithms. I appreciate that your DMA engine microcode
is available; have you applied for any patents?

From: matt.reilly on
On Apr 6, 1:19 pm, already5cho...(a)yahoo.com wrote:
> Matt,
> Could you comment on Paul Gotch's speculations above (about MIPS
> 20Kc) ?

We looked at the 20Kc and liked it. However, it was a hard macro
(that is, MIPS supplies completed masks, not synthesizable verilog)
and designed for 130nm. Our technology target was 90nm. The team
had lots of experience in doing design shrinks and felt the cost
of shrinking the 20Kc was prohibitive. Further, the 20Kc as it stood
was not designed for a cache coherent SMP. Fitting the necessary
changes into a hard macro made it even more problematic.

We chose the 5Kf, a 64 bit soft IP block from MIPS. Then we
worked hard at the synthesis flow to make it run at 500 MHz
and stay within a sub-watt power budget.

matt
First  |  Prev  |  Next  |  Last
Pages: 1 2 3 4 5 6 7 8 9
Prev: Committed Instructions
Next: Need of "Precise Exceptions"