CUDA Tile Open Sourced

(github.com)

83 points | by JonChesterfield 6 days ago

5 comments

  • toolboxg1x0 8 minutes ago
    NVIDIA tensor core units, where the second column in kernel optimization is producing a test suite.
  • xmorse 31 minutes ago
    Writing this in Mojo would have been so much easier
    • 3abiton 24 minutes ago
      It's barely gaining adoption though. The lack of buzz is a chicken and egg issue for Mojo. I fiddled shortly with it (mainly to get it working some of my pythong scripts), and it was suprisingly easy. It'll shoot up one day for sure if Latner doesn't give up early on it.
  • boywitharupee 16 minutes ago
    shouldn't the title be "CUDA Tile IR Open Sourced"?
  • CamperBob2 43 minutes ago
    Fun game: see how many clicks it takes you to learn what MLIR stands for.

    I lost count at five or six. Define your acronyms on first use, people.

    • roughly 8 minutes ago
      The ol’ TMA problem.
    • piskov 6 minutes ago
      If only there was a chat-based app that you could ask questions to.
    • fragmede 36 minutes ago
      I did it in three. I selected it in your comment, and then had to hit "more" to get to the menu to ask Google about it, which brought me to https://www.google.com/search?q=MLIR which says: MLIR is an open-source compiler infrastructure project developed as a sub-project of the LLVM project. Hopefully

      Get better at computers stop needing to be spoon-fed information, people!

      • reactordev 22 minutes ago
        In this day and age, asking questions about what something is is a minefield of “just ask AI” and “You should know this”. Let’s stop putting down people who ask questions and root out those that have shitty answers.
      • poita66 19 minutes ago
        And yet you didn’t tell us what it stands for, just what it is. The person you’re responding to was specifically talking about finding out what it stands for
  • jauntywundrkind 1 hour ago
    Will be interesting to see if Nvidia and other have any interest & energy getting this used by others, if there actually is an ecosystem forming around it.

    Google leading XLA & IREE, with awesome intermediate representations, used by lots of hardware platforms, and backing really excellent Jax & Pytorch implementations, having tools for layout & optinization folks can share: they really build an amazing community.

    There's still so much room for planning/scheduling, so much hardware we have yet to target. RISC-V has really interesting vector instructions, for example, and it seems like there's so much exploration / work to do to better leverage that.

    Nvidia has partners everywhere now. Nvlink is used by Intel, AWS Tritanium, others. Yesterday the Groq exclusive license that Nvidia paid to give to Groq?! Seeing how and when CUDA Tiles emerges: will be interesting. Moving from fabric partnerships, up up up the stack.

    • pjmlp 57 minutes ago
      For NVidia it suffices this is a Python JIT allowing programming CUDA compute kernels directly in Python instead of C++, yet another way how Intel and AMD, alongside Khronos APIs, lag behind in great developer experiences for GPU compute programming.

      Ah, and Nsight debugging also supports Python CUDA Tiles debugging.

      https://developer.nvidia.com/blog/simplify-gpu-programming-w...

    • Moosdijk 1 hour ago
      > There's still so much room for planning/scheduling, so much hardware we have yet to target

      this is nicely illustrated by this recent article:

      https://news.ycombinator.com/item?id=46366998

    • turtletontine 1 hour ago
      On the RISC-V vector instructions, could you elaborate? Are the vector extensions substantially different from those in ARM or x86?
      • adgjlsfhk1 48 minutes ago
        it's fairly similar to Arm's sve2, but very different from the x86 side in that the instructions are variable length rather than fixed