“The ability to step through individual processes in addition to aggregations of processors is useful, as is the fact that breakpoints are saved when a job exits. The developer working on large hydrodynamics code lauds TotalView’s breakpoint management capabilities, as well as its ability to scale transparently up to thousands of processors or processes while remaining easy to use. Also, being able to set conditional breaks has been helpful.” He says, “TotalView has been extremely helpful as part of my development process in finding bugs.I like the fact that I can look at all jobs in parallel and really see what is going on - on all processors at the same time. The advanced debugging capabilities of TotalView, including independent thread control, multi-platform support, register and instruction level debugging, and a built-in memory debugger, have been proven to reduce development time in some areas by more than 20 percent.įor the LLNL programmer developing scalable linear solvers, using TotalView has yielded great benefits. TotalView is the most proven scalable debugging product of its kind, able to handle from one to thousands of processes. LLNL developers use the TotalView debugger to understand and reduce the complexity of developing applications on Blue Gene/L. Other applications developed on Blue Gene/L at LLNL include a large multiphysics code written in C, which runs on a variety of platforms and has been used for scaling studies up to 12,000 processors. The goal is scalability across a large number of processors, which is difficult since the algorithms require a large amount of communication across many processors. “On the other hand, the debugger needs to work fairly quickly at this scale to be of real use.”Īnother programmer is developing scalable linear solvers, mainly algebraic multi-grid, written in C and consisting of short programs designed to be used by bigger applications. ![]() “We don’t always have the luxury of scaling back to 2048 processors,” he said. It will soon be able to run on all 12,800 processors of the Blue Gene/L machine.”įor this developer, the biggest challenge is debugging his code that crashes when scaled on a very large number of processors. It incorporates many different third-party libraries and thus must embrace a large number of coding styles with different language features used. It has a variety of network communication patterns that are dynamic over time. It compiles to a 37-MB executable on Blue Gene/L when optimized. It is a mixture of C, C++, Fortran and Fortran 90. One developer at LLNL described his application: “The code is a large, highly portable hydrodynamics code. Applications written for the IBM Blue Gene/L are highly complex, using thousands of processors and consuming gigabytes of memory, and developing efficient code for such an advanced supercomputer presents great challenges for developers.Īpplications being developed range from “simple” scalable linear solvers to large hydrodynamic and simulation codes that use multiple languages and network communication patterns. Researchers at LLNL develop mission-critical Grand Challenge applications using the IBM Blue Gene/L, one of the fastest supercomputers in the world. ![]() Solving Application Development Challenges Blue Gene/L is also used to explore the potential of system-on-a-chip technologies to achieve extreme speed while minimizing floor space and electrical power consumption. Given the critical role that LLNL plays in national security, it is no surprise that they utilize the world’s fastest supercomputers, the IBM Blue Gene series, to develop their mission-critical applications.Īt LLNL, Blue Gene/L is optimized to run molecular dynamics applications at extreme speeds to address materials aging issues confronting the Stockpile Stewardship Program. With its special capabilities, the laboratory also meets other pressing national and international security needs, such as countering the threat of nuclear proliferation and terrorism, strengthening homeland security, and enhancing military effectiveness. As a national security laboratory, Lawrence Livermore National Laboratory (LLNL) is responsible for ensuring that the nation’s nuclear weapons remain safe, secure and reliable by applying the latest advances in science and engineering.
0 Comments
Leave a Reply. |