Optimizing Data Flow for Scientific Discovery – Information Sciences Institute

By Taaza Facts

Published on:

Optimizing Data Flow for Scientific Discovery – Information Sciences Institute

No one can precisely predict when an earthquake is to happen. Since the 1994 6.7-magnitude Northridge earthquake in Los Angeles County that caused 72 deaths, 9,000 injuries and $25 billion in damages, Southern California has been anxiously waiting for “The Big One”: a devastating quake predicted to be at least a 7.8 magnitude and 44 times stronger. Seismologists can only say that it may happen within the next 30 years.

Earthquake Calculations

Although scientists cannot forecast when and where earthquakes will strike, preparation is key to improving society’s resilience to large earthquakes. In particular, the USC-based Statewide California Earthquake Center (SCEC) developed CyberShake, a computational platform that simulates hundreds of thousands of earthquakes to calculate regional seismic hazard models. Revealing geographical areas in Southern California most at risk for intense shaking, its results have influenced Los Angeles building codes and the design of the earthquake models at the U.S. Geological Survey, the nation’s largest earth and geological science mapping agency.

CyberShake studies—and much of modern science, however—are highly data and computing-intensive. With multi-step calculations that feed into numerous interconnected computational tasks executing on local and national supercomputers to simulate 600,000 different earthquakes, CyberShake’s scientific workflow is complex. USC Viterbi’s Information Sciences Institute (ISI) houses the tools to generate and manage such massive data.

Ewa Deelman, a research professor in computer science and research director at ISI, has continuously designed and updated, since 2000, an automated workflow management system called Pegasus.

Optimized Workflows

Pegasus—named after Planning for Execution and Grids (PEG) and Deelman’s love for horses—turns research experiments into optimized workflows. It can be used by scientists in various fields from seismology to physics to bioinformatics because of its abstract design. Deelman likens it to a cooking recipe: “You can use the same recipe in different kitchens. Different users can run the recipe (the workflow) but with their own cookware (computational resources). When you design things in a broad enough way, they become widely applicable.” In 2016, scientists from the Laser Interferometer Gravitational-Wave Observatory (LIGO) utilized Pegasus to capture gravitational waves in the universe, confirming Albert Einstein’s General Theory of Relativity and earning the 2017 Nobel Prize for physics. During the 16-year collaboration between ISI computer scientists and LIGO members, the software managed thousands of workflows with millions of tasks.

The Collaborative and Adaptive Sensing of the Atmosphere (CASA), an engineering research center dedicated to improving hazardous weather prediction and response, has also ported its pipelines into Pegasus. As severe weather can slow and compromise local resources and computing capacity, the program sends CASA’s data into cloud infrastructures to ensure continuous workflow.

Inspired by Animal Behaviors

CyberShake has relied on Pegasus for the past 15 years, including its most recent study with its largest set of earthquake simulations yet. Pegasus managed 2.5 petabytes of data and ran 28,120 workflow jobs over 108 days to produce seismic hazard maps in 772,000 node-hours.

“Without Pegasus, there’s no way we’d be able to do this kind of science,” said Scott Callaghan, a computer scientist at SCEC and lead developer on CyberShake. SCEC will be expanding CyberShake to Northern California, now using the fastest supercomputer in the world, Frontier. Pegasus will continue to remain at their side. “Every time we do one of these studies, we always encounter unexpected challenges. But I’m confident that, with any workflow issues, the Pegasus team will be able to help us work through them so that we can continue getting cutting-edge science done,” Callaghan said.

Deelman is now conducting research and conceptualizing SWARM, another workflow management system inspired by the savvy coordination of group behaviors amongst social animals, like ants. She also plans to enhance Pegasus’ decision-making with artificial intelligence, reimagining how workflow systems will operate in the future.

Published on May 13th, 2024

Last updated on May 16th, 2024

Taaza Facts

I am a multifaceted content creator with expertise in blogging, Finance, and Cryptocurrency reviews. My creative journey involves weaving captivating stories in blogs, designing aesthetically pleasing and functional websites, and dissecting the nuances of cinema. We are dedicated to sharing our passion and insights with a global audience.

Related Post

Freshers Jobs Vacancy – Data Scientist 1 Job Opening at Swiggy

Swiggy is hiring for the role of Data Scientist 1 for the Remote location. Please read the entire information carefully and apply if you are eligible for the Data ...

Trojans celebrate resilience and being ‘Free to Live Out Loud’ at virtual Juneteenth event

When the Rev. Brandon Harris took the stage to open Wednesday’s Juneteenth celebration, he shared with the audience of more than 150 students, faculty, alumni and ...

Freshers Jobs Vacancy – Developer Job Opening at Barclays

Barclays is hiring for the role of  Developer for the Pune location. Please read the complete information carefully and apply if you are eligible for a Developer Job Opening ...

Freshers Jobs Vacancy – Junior Data Analyst Job Opening at BCG

Boston Consulting Group is hiring for Junior Data Analyst for New Delhi location. Interested people can read the entire information carefully and apply if you are eligible Junior Data ...

Leave a Comment

Actor John Abraham Purchases Bungalow In Mumbai For Rs 75.83 Crore Ankita Lokhande: From TV Stardom to Bollywood Dreams | Inspiring Journey Ayesha Khan Lifestyle, Bigg Boss 17 Wild Card, Boyfriend, Family, Munawar faruqui, Age, Biography Kalki 2898 AD – 100 Times Bigger Than Salaar Prabhas Film Bigg Boss 17: Salman Khan scolded Ayesha badly, got angry on Munawar-Ayesha matter