Why Scientific Workflows?

Scientific workflows are a flexible representation to declaratively express applications with data and control dependencies, and are mainstream in domains such as astronomy, physics, climate science, earthquake science, biology, and others. Learn more.

 

Our Vision

The SciTech research group aims to empower the scientific community by tightening the relations between the domain scientists and current computational resources. As a result, scientists can focus on their research questions, while our open-source tools provide the computational foundations to seamlessly run their experiments and analyses in local and distributed resources.

Automation

We provide tools to automate and advance scientific analyses from the scientist's desktop to clouds and world-class supercomputers. Learn more.

Reproducibility

We enable computational reproducible research via data provenance, and data and software preservation mechanisms. Learn more.

Big Data

Our tools automatically manage large volumes of data transfers between different computational resources and data repositories. Learn more.

Data Science

We provide tools to collect fine-grained performance data from experiments and the computational environment. Learn more.

Latest News

Seminar: Scheduling and Memory Management for Large-Scale Applications: From Caches to Burst Buffers
with No Comments

Abstract: This talk explores scheduling problems in the context of large-scale applications from a memory perspective. We focus here on two very different levels of memory in the hierarchy: caches and burst buffers.With the recent advent of many-core architectures such as chip multiprocessors … Read More

Seminar: DISTRIBUTE HIGH THROUGHPUT COMPUTING AT WORK, AN OSG STATUS REPORT
with No Comments

Abstract: For more than 15 years, the Open Science Grid (OSG) has been offering the science community a fabric of distributed High Throughput Computing (dHTC) services. In close collaboration with science and campus communities as well as resource and software … Read More

Seminar: defoe: A Spark-based Toolbox for Analysing Digital Historical Textual Data
with No Comments

In this talk will present defoe, a new scalable and portable digital toolbox that enables historical research. It allows for extracting knowledge from historical data by running text analyses across large digital collections, such as historical newspapers and books in parallel. … Read More

Seminar: The Cyberinfrastructure of Gravitational-wave Astronomy and March towards LIGO Open Data
with No Comments

The discovery of gravitational waves by LIGO and Virgo has been a revolution event in astronomy and physics. In this talk, I will discuss some of the cyberinfrastructure that is used to explore the universe with gravitational waves, including: the … Read More

Seminar: Co-scheduling for large-scale applications: memory and resilience
with No Comments

This talk explores co-scheduling problems in the context of large-scale applications with two main focus: the memory side, in particular the cache memory and the resilience side. With the recent advent of many-core architectures such as chip multiprocessors (CMP), the … Read More

Contributions of two SciTech’s DARPA-funded projects featured in the DARPA 60th Anniversary Digital Magazine
with No Comments

For the past 3 years the SciTech research group (led by Dr. Ewa Deelman) has enabled research endeavors via two DARPA-funded projects: RACE and MINT. The RACE (Repository and Workflows for Accelerating Circuit Realization) project is part of the DARPA CRAFT … Read More

Seminar: Checkpointing Workflows for Fail-Stop Error
with No Comments

We consider the problem of orchestrating the execution of workflow applications structured as Directed Acyclic Graphs (DAGs) on parallel computing platforms that are subject to fail-stop failures. The objective is to minimize expected overall execution time, or makespan. A solution … Read More

Job Opening: 2 positions for Programmer Analyst II
with No Comments

The SciTech group does research and development on software systems to help scientists manage large-scale computations. We work with scientists in domains ranging from genomics and proteomics to seismology and gravitational wave physics. We help scientists deploy their computations on … Read More