2024 SPARC FAIR Codeathon

The SPARC Data and Resource Center, with funding from the National Institutes of Health (NIH), will host a three-day in-the-cloud codeathon.

Event Details

Updated at: 06/10/2024

2024 SPARC FAIR Codeathon

Pitch your project: [Click Here]

Project Pitch closes Monday, June 17.


Project seed ideas

  • Development of connectivity visualization/interaction tools (e.g. using cytoscape widgets in Jupyter; D3 in Javascript). Potential projects are not limited to but could involve:

    • Connectivity matrix visualizations
    • Dendrogram
    • Organ coverage
    • Species contribution
    • Connectivity paths
    • Export of visualizations for documentation/presentations
    • Exploration of neuron populations
  • Simple Protocols.io widget that provides a quick overview of the protocol that could be integrated with the portal.

  • Extensible tools for information extraction from datasets (e.g. csv spreadsheets, json data) for the population of a quantitative database

  • Use the SPARC Python Client to perform complex queries on the SPARC Portal (e.g. finding datasets containing certain files types) and reuse that data. See an example tutorial here

  • Turn your code/model into a module for the o²S²PARC platform, by using the cookiecutter o²S²PARC service library (examples will be provided). Bonus: use SPARC data to build your model or analysis pipeline.

  • Derive computational models from the vagus scaffold to explore electrical conductivity using o²S²PARC S4Llite https://sparc.science/news-and-events/news/5U9a8F2TgWKDiFH6Cyw51i

  • Implement a more refined and/or visual user-friendly query interface for finding and navigating datasets and/or dataset files

  • Create a tiff image viewer component for viewing .tiff files on the portal

  • Allow a user to select what metadata gets displayed for dataset results and provide the ability to sort the results based off different metadata properties

  • Use augmented reality to create a new experience of exploring scaffolds with annotations for linking to datasets , e.g. exporting scaffolds to GL Transmission Format.

  • Encoding experimental data into a Scaffold VTK export for visualisation/simulation on o²S²PARC.


FAIR Codeathon FAQs:

Who can participate? Teams will greatly benefit from people who possess any of the following skills:

  • Data mining, image and text analysis
  • Working knowledge of scripting languages (e.g., Shell, Python, R)
  • Familiarity with methods for manipulating and/or analyzing large datasets (AI/ML, computational modeling, etc.)
  • Developing bioinformatics code, pipelines or tools
  • Data visualization
  • Knowledge graphs
  • Web development
  • Understanding of neuroscience and/or neurostimulation

Do I have to lead a team? You can choose to lead your project team, recommend someone, or we can try to find a suitable team lead. Providing a designated team lead dramatically increases the probability that we will select the project for the codeathon.

Sign up to be informed when to join a team: [Click Here]

Do I need to assemble a team? No. We will create working groups of five to six individuals who have various backgrounds and relevant expertise to work on each project.

What are my responsibilities as a team lead? The team leader will coordinate a group of 5-6 people in defining the project and producing clear vision for developing a solution. To accomplish this goal, the team lead must define and delegate tasks, incorporate team members’ ideas to accomplish the goal, and ensure the team’s success.

What if I only want to participate? Applications for those who would like to participate in the codeathon will be available in late June. Here is the link to sign up.

What will a typical day be like? We will meet regularly as a group throughout the codeathon, exact timing to be determined as teams are formed. SPARC Data & Resource Center teams will host regular office hours enabling teams to seek guidance.

What will we build? We will make all pipelines, other scripts, software, and programs generated in this codeathon available on a dedicated public GitHub organisation.

Teams may submit manuscripts describing the design and use of the software tools they created to an appropriate journal such as the F1000Research hackathons channel, GigaScience, or PLoS Computational Biology.

What is FAIR? The volume of publicly available data continues to rise exponentially, but the capacity for fully employing this data is being hampered by a series of limitations. FAIR is a very powerful initiative that has taken root worldwide. The initiative has the potential to significantly increase the value of life science data sets. While the concept shares some commonality with the semantic web, FAIR data goes further to expand opportunities for knowledge-sharing and value. Here are four foundation papers on this exploding field:

More information about SPARC:

There are several tools and resources available that may be leveraged when designing the projects. Codeathon projects should result in code, tools (see SODA for an example), datasets, or other outputs which are open and freely available.

Investigators are publishing lots of FAIR data from a range of species, and spanning all the major visceral organs and peripheral nerves, as they seek to better understand the autonomic nervous system. The data being collected includes microscopy, electrophysiological and mechanical time series, single-cell RNASeq, functional MRI, and more.

SPARC data is highly curated, ensuring the data is published in a FAIR manner and backed by a semantic knowledge base. SPARC data is further enriched by being mapped to 2D “flatmaps” that enable visual exploration of the topological anatomy of the peripheral nervous system. Where possible, data is also mapped to 3D organ scaffolds to provide a common coordinate system enabling comparison across subjects, species, and protocols as well as interactive environments to aid understanding and interpretation of SPARC data. The open-source tools for mapping data to scaffolds are available.

One special data resource, called “simulations”, consists of computational models and data analysis pipelines. These simulations can be run on o²S²PARC, which was designed to host, modularize, and ensure the reproducibility of simulations contributed by researchers. This is achieved by archiving contributed simulation code along with the code’s execution environment with versioning.

SPARC data, simulations, and maps are published on the SPARC Portal, an open-source platform for finding, exploring, visualizing, interacting with, and accessing SPARC data and associated computational models and analyses.


Legal
Participants retain ownership of all intellectual property rights (including moral rights) to the code submitted to as well as developed in the codeathon. Employees of the U.S. Government attending as part of their official duties retain no copyright to their work and their work is in the public domain in the U.S. The Government disclaims any rights to the code submitted or developed in the codeathon. Participants agree to publish the code and any related data on GitHub.

Please feel free to contact the FAIR Codeathon team if you have questions or need more information: fair-codeathon-support@sparc.science.