AIW i2b2 ETL

AIW i2b2 ETL is a Protempa query results handler for loading data and abstractions into i2b2.

License	License Apache License, Version 2.0
Categories	Categories CLI User Interface Eureka Container Microservices
GroupId	GroupId org.eurekaclinical
ArtifactId	ArtifactId aiw-i2b2-etl
Last Version	Last Version 4.7
Release Date	Release Date 12-Jan-2021
Type	Type jar
Description	Description AIW i2b2 ETL AIW i2b2 ETL is a Protempa query results handler for loading data and abstractions into i2b2.
Project URL	Project URL https://github.com/eurekaclinical/aiw-i2b2-etl.git
Project Organization	Project Organization Emory University
Source Code Management	Source Code Management https://github.com/eurekaclinical/aiw-i2b2-etl.git

Download aiw-i2b2-etl

Filename	Size
aiw-i2b2-etl-4.7.pom
aiw-i2b2-etl-4.7.jar	405 KB
aiw-i2b2-etl-4.7-sources.jar	297 KB
aiw-i2b2-etl-4.7-javadoc.jar	606 KB
Browse

How to add to project

Apache Maven

<!-- https://jarcasting.com/artifacts/org.eurekaclinical/aiw-i2b2-etl/ -->
<dependency>
    <groupId>org.eurekaclinical</groupId>
    <artifactId>aiw-i2b2-etl</artifactId>
    <version>4.7</version>
</dependency>

Gradle Groovy

// https://jarcasting.com/artifacts/org.eurekaclinical/aiw-i2b2-etl/
implementation 'org.eurekaclinical:aiw-i2b2-etl:4.7'

Gradle Kotlin

// https://jarcasting.com/artifacts/org.eurekaclinical/aiw-i2b2-etl/
implementation ("org.eurekaclinical:aiw-i2b2-etl:4.7")

Apache Buildr

'org.eurekaclinical:aiw-i2b2-etl:jar:4.7'

Apache Ivy

<dependency org="org.eurekaclinical" name="aiw-i2b2-etl" rev="4.7">
  <artifact name="aiw-i2b2-etl" type="jar" />
</dependency>

Groovy Grape

@Grapes(
@Grab(group='org.eurekaclinical', module='aiw-i2b2-etl', version='4.7')
)

Scala SBT

libraryDependencies += "org.eurekaclinical" % "aiw-i2b2-etl" % "4.7"

Leiningen

[org.eurekaclinical/aiw-i2b2-etl "4.7"]

Dependencies

compile (8)

Group / Artifact	Type	Version
org.eurekaclinical : protempa-framework	jar	5.2-Alpha-2
org.eurekaclinical : etl-common	jar	0.0.5
org.apache.commons : commons-lang3	jar	3.9
org.eurekaclinical : protempa-dsb-relationaldb	jar	5.2-Alpha-2
org.apache.commons : commons-csv	jar	1.7
com.sun.xml.bind : jaxb-core	jar	2.3.0.1
javax.xml.bind : jaxb-api	jar	2.3.1
com.sun.xml.bind : jaxb-impl	jar	2.3.1

test (8)

Group / Artifact	Type	Version
junit : junit	jar	4.12
org.dbunit : dbunit	jar	2.6.0
org.eurekaclinical : protempa-bconfigs-ini4j-ini	jar	5.2-Alpha-2
org.apache.commons : commons-dbcp2	jar	2.7.0
com.h2database : h2	jar	1.4.193
org.liquibase : liquibase-core	jar	3.6.3
org.apache.tomcat : tomcat-catalina	jar	9.0.26
org.eurekaclinical : eurekaclinical-ontology	jar	2.0.5

Project Modules

There are no modules declared in this project.

Protempa i2b2 Tools

Department of Biomedical Informatics, Emory University, Atlanta, GA

What does it do?

This project provides the ability to read data and concepts from an i2b2 deployment, and write data and computed temporal sequences to the same or a different i2b2 deployment. I2b2 is a data warehousing technology for biomedical research. Specifically, this project provides:

a Protempa destination, edu.emory.cci.aiw.i2b2etl.dest.I2b2Destination, for loading data into an i2b2 database. Protempa destinations implement the org.protempa.dest.Destination interface and process output from the temporal abstraction process.
a Protempa data source backend, org.emory.cci.aiw.i2b2etl.dsb.I2b2DataSourceBackend, for reading data from an i2b2 database. Protempa data source backends implement the org.protempa.dsb.DataSourceBackend interface.
a Protempa knowledge source backend, org.emory.cci.aiw.i2b2etl.dsb.I2b2KnowledgeSourceBackend, for fetching clinical concepts from an i2b2 ontology cell with a modified schema. Protempa knowledge source backends implement the org.protempa.ksb.KnowledgeSourceBackend interface.

See the Protempa project's README for more details on Protempa's architecture.

Latest release:

Version 4.0

Updated Protempa version requirements.

Version 3.0.1

Depend on a range of Protempa versions that are backward compatible.

Version 3.0

Version 3 is primarily focused around performance improvements.

Build requirements

Runtime requirements

Building it

The project uses the maven build tool. Typically, you build it by invoking mvn clean install at the command line. For simple file changes, not additions or deletions, you can usually use mvn install. See https://github.com/eurekaclinical/dev-wiki/wiki/Building-Eureka!-Clinical-projects for more details.

Maven dependency

<dependency>
    <groupId>org.eurekaclinical</groupId>
    <artifactId>aiw-i2b2-etl</artifactId>
    <version>version</version>
</dependency>

Installation

Put the aiw-i2b2-etl jarfile and its dependencies in the classpath, and Protempa will automatically register the data source backend and knowledge source backend.

Additional i2b2 destination installation

The i2b2 destination requires adding tables and stored procedures to the i2b2 data schema. A Liquibase changelog contains the DDL. The following files contain the SQL for creating the stored procedures:

Oracle
- Package declaration
- Package implementation
PostgreSQL
- Stored procedures

The i2b2 destination requires adding stored procedures to the i2b2 metadata schema. The following files contain the SQL for creating the stored procedures:

Oracle
- Package declaration
- Package implementation
PostgreSQL
- Stored procedures

Additional i2b2 knowledge source backend installation

See the eurekaclinical-ontology project's README for how to create an i2b2 metadata schema that the i2b2 knowledge source backend can read. Eureka! Clinical requires some extensions to the schema that i2b2 ships out of the box. The ontology project also contains Liquibase changelogs that install various common terminologies into such an i2b2 metadata schema.

Using it

Backend configuration

`edu.emory.cci.aiw.i2b2etl.ksb.I2b2KnowledgeSourceBackend`

databaseApi: DRIVERMANAGER or DATASOURCE depending on whether the databaseId property contains a JDBC URL or a JNDI URL, respectively.
databaseId: a JDBC URL or a JNDI URL for connecting to the i2b2 metadata schema.
username: for JDBC URLs, a database username.
password: for JDBC URLs, a database password.
targetTable: the name of the metadata table that contains computed temporal sequences.

`edu.emory.cci.aiw.i2b2etl.dsb.I2b2DataSourceBackend`

databaseAPI: DRIVERMANAGER or DATASOURCE depending on whether the databaseId property contains a JDBC URL or a JNDI URL, respectively.
databaseId: a JDBC URL or a JNDI URL for connecting to the i2b2 data schema.
username: for JDBC URLs, a database username.
password: for JDBC URLs, a database password.
schemaName: the name of the data schema to query. If specified, the schema name will be included in queries. If not specified, no schema name will be specified, and the schema that will be queried will be database dependent. Typically, the database will query the user's default schema.

Examples

import org.protempa.SourceFactory;
import org.protempa.backend.Configurations;
import org.protempa.bconfigs.ini4j.INIConfigurations;
import org.protempa.Protempa;
import org.protempa.dest.Destination;
import org.protempa.dest.map.MapDestination;
import org.protempa.query.DefaultQueryBuilder;
import org.protempa.query.Query;
import edu.emory.cci.aiw.i2b2etl.dest.I2b2Destination;
import edu.emory.cci.aiw.i2b2etl.dest.config.Configuration;

// An implementation of org.protempa.backend.Configurations provides the backends to use.
Configurations backends = new INIConfigurations(new File("src/test/resources"));
SourceFactory sourceFactory = new SourceFactory(backends.load("protempa-config.ini"));

// Use try-with-resources to ensure resources are cleaned up.
try (Protempa protempa = Protempa.newInstance(sourceFactory)) {
    DefaultQueryBuilder q = new DefaultQueryBuilder();
    q.setName("My test query");
    q.setPropositionIds(new String[]{"ICD9:Diagnoses", "ICD9:Procedures", "LAB:LabTest", "Encounter", "MED:medications", "VitalSign",     
        "PatientDetails"}); // an array of the concept ids of the data to retrieve and/or temporal patterns to compute
    Query query = protempa.buildQuery(q);

    // An implementation of org.protempa.dest.Destination processes output from the temporal abstraction process.
    Configuration i2b2Config = //implementation of edu.emory.cci.aiw.i2b2etl.dest.config.Configuration
    Destination dest = new I2b2Destination(i2b2Config); 
    protempa.execute(query, dest);
}

The protempa-config.ini might contain the following sections for configuring the i2b2 data source backend and knowledge source backend:

[edu.emory.cci.aiw.i2b2etl.dsb.I2B2DataSourceBackend]
dataSourceBackendId=Unique identifier for this i2b2 repository
databaseId = JDBC URL for the data schema
username = username with privileges to read the data schema
password = data schema user's password
schemaName = name of the data schema

[edu.emory.cci.aiw.i2b2etl.ksb.I2b2KnowledgeSourceBackend]
databaseAPI = DATASOURCE # any setters in the implementation that have the @BackendProperty annotation.
databaseId = java:/comp/env/jdbc/I2b2KS
targetTable = EUREKAPHENOTYPEONTOLOGY

[org.protempa.backend.asb.java.JavaAlgorithmBackend]

The i2b2 data and knowledge source backends read data directly from i2b2's database schemas rather than through its web services APIs for performance. Similarly, the i2b2 Protempa destination loads data into i2b2 directly into its data schema rather than through its web services APIs for performance.

Developer documentation

Javadoc for latest development release

Eureka! Clinical

Microservices for clinical and translational research

Versions

Version
4.7 12-Jan-2021
4.6 12-Aug-2020
4.4 04-Dec-2019
4.3 02-Dec-2019
4.2 31-Oct-2019
4.0 21-Sep-2018
4.0-Beta-8 07-Aug-2018
4.0-Beta-7 03-Aug-2018
4.0-Beta-6 30-Jul-2018
4.0-Beta-5 29-Jul-2018
4.0-Beta-4 26-Jul-2018
4.0-Beta-3 26-Jul-2018
4.0-Beta-2 25-Jul-2018
4.0-Beta-1 25-Jul-2018
4.0-Alpha-2 14-Jul-2018
4.0-Alpha-1 10-Jul-2018
3.0.1 28-Jun-2018
3.0 16-Feb-2018
3.0-Beta-2 11-Feb-2018
3.0-Beta-1 11-Feb-2018
3.0-Alpha-24 10-Feb-2018
3.0-Alpha-23 03-Feb-2018
3.0-Alpha-22 19-Aug-2017
3.0-Alpha-21 19-Aug-2017
3.0-Alpha-20 03-Aug-2017
3.0-Alpha-19 05-Jun-2017
3.0-Alpha-18 26-May-2017
3.0-Alpha-17 26-May-2017
3.0-Alpha-16 05-Apr-2017
3.0-Alpha-15 05-Apr-2017
3.0-Alpha-14 04-Apr-2017
3.0-Alpha-13 24-Feb-2017
3.0-Alpha-12 14-Feb-2017
3.0-Alpha-11 01-Feb-2017
3.0-Alpha-10 01-Feb-2017
3.0-Alpha-9 31-Jan-2017
3.0-Alpha-8 30-Jan-2017
3.0-Alpha-7 23-Jan-2017
3.0-Alpha-6 21-Jan-2017
3.0-Alpha-5 18-Jan-2017
3.0-Alpha-4 17-Jan-2017
3.0-Alpha-3 17-Jan-2017
3.0-Alpha-2 17-Jan-2017
3.0-Alpha-1 17-Jan-2017
2.4.1 14-Feb-2017
2.4 12-Feb-2017
2.4-Beta-1 11-Feb-2017
2.3.1-Beta-1 17-Jan-2017
2.3 18-Nov-2016
2.2.2 18-Nov-2016
2.2.1 14-Nov-2016
2.2 04-Nov-2016
2.1.2 27-Sep-2016
2.1.1 27-Sep-2016
2.1 26-Sep-2016
2.0 16-Sep-2016
2.0-Alpha-40 10-Sep-2016
2.0-Alpha-39 04-Sep-2016
2.0-Alpha-38 02-Sep-2016
2.0-Alpha-37 02-Sep-2016
2.0-Alpha-36 02-Sep-2016
2.0-Alpha-35 18-Aug-2016
2.0-Alpha-34 18-Aug-2016
2.0-Alpha-33 17-Aug-2016
2.0-Alpha-32 17-Aug-2016
2.0-Alpha-31 11-Aug-2016
2.0-Alpha-30 04-Aug-2016
2.0-Alpha-29 18-Jul-2016
2.0-Alpha-28 07-Jul-2016
2.0-Alpha-27 05-Jul-2016
2.0-Alpha-26 02-Jul-2016
2.0-Alpha-25 30-Jun-2016
2.0-Alpha-24 28-Jun-2016
2.0-Alpha-23 18-May-2016
2.0-Alpha-22 05-Apr-2016
2.0-Alpha-21 05-Apr-2016
2.0-Alpha-20 04-Mar-2016
2.0-Alpha-19 02-Mar-2016
2.0-Alpha-18 02-Mar-2016
2.0-Alpha-17 27-Feb-2016
2.0-Alpha-16 26-Feb-2016
2.0-Alpha-15 20-Feb-2016
2.0-Alpha-14 20-Feb-2016
2.0-Alpha-13 20-Feb-2016
2.0-Alpha-12 19-Feb-2016
2.0-Alpha-11 18-Feb-2016
2.0-Alpha-10 18-Feb-2016
2.0-Alpha-9 17-Feb-2016
2.0-Alpha-8 16-Feb-2016
2.0-Alpha-7 15-Feb-2016
2.0-Alpha-6 11-Feb-2016
2.0-Alpha-5 10-Dec-2015

AIW i2b2 ETL

License

Categories

GroupId

ArtifactId

Last Version

Release Date

Type

Description

Project URL

Project Organization

Source Code Management