In this tutorial, we will perform an integrated analysis of ATAC-seq and RNA-seq during mouse development. The original analysis was published in Zhang, 2019.

This tutorial requires following additional software:

  • samtools
  • MACS2
  • bwa
  • bedGraphToBigWig

Preparing input

We will download the data from the ENCODE portal. First add the IDs of files to the input.tsv, and associate each record with the ENCODE tag. We will download and analyze raw FASTQ file for ATAC-seq experiments. For RNA-seq experiments, post-processed gene quantifications are available to download. So we will use those as our input, Taiji is capable of analyzing the raw FASTQ for RNA-seq as well though. To indicate that the files are gene quantifications, add GeneQuant tags to those files.

input.tsv:

type	id	group	rep	path	tags
ATAC-seq	ENCSR204ZTY	limb_E15.5	1	ENCFF629FRA,ENCFF222KJL	ENCODE
ATAC-seq	ENCSR204ZTY	limb_E15.5	2	ENCFF488EFS,ENCFF917DTZ	ENCODE
ATAC-seq	ENCSR150RMQ	embryonic facial prominence_E11.5	2	ENCFF741MLW,ENCFF460ETY	ENCODE
ATAC-seq	ENCSR150RMQ	embryonic facial prominence_E11.5	1	ENCFF667CEX,ENCFF165UKX	ENCODE
ATAC-seq	ENCSR358MOW	embryonic facial prominence_E13.5	2	ENCFF104GCO,ENCFF737HFT	ENCODE
ATAC-seq	ENCSR358MOW	embryonic facial prominence_E13.5	1	ENCFF166ANK,ENCFF512CMK	ENCODE
ATAC-seq	ENCSR983JWA	neural tube_E15.5	1	ENCFF847BCI,ENCFF608AWT	ENCODE
ATAC-seq	ENCSR983JWA	neural tube_E15.5	2	ENCFF551SAG,ENCFF166QOA	ENCODE
ATAC-seq	ENCSR623GSD	hindbrain_E16.5	1	ENCFF568THX,ENCFF196ILN	ENCODE
ATAC-seq	ENCSR623GSD	hindbrain_E16.5	2	ENCFF950API,ENCFF946KOT	ENCODE
ATAC-seq	ENCSR088UYE	hindbrain_E12.5	1	ENCFF954MPR,ENCFF824PVC	ENCODE
ATAC-seq	ENCSR088UYE	hindbrain_E12.5	2	ENCFF199WNL,ENCFF968KUR	ENCODE
ATAC-seq	ENCSR618HDK	stomach_E14.5	2	ENCFF395TZD,ENCFF395IGM	ENCODE
ATAC-seq	ENCSR618HDK	stomach_E14.5	1	ENCFF699GSF,ENCFF944CWK	ENCODE
ATAC-seq	ENCSR961SMM	intestine_E15.5	2	ENCFF867QVP,ENCFF771QNR	ENCODE
ATAC-seq	ENCSR961SMM	intestine_E15.5	1	ENCFF674OOP,ENCFF775MXU	ENCODE
ATAC-seq	ENCSR261ICG	embryonic facial prominence_E15.5	1	ENCFF724ZGT,ENCFF312YEC	ENCODE
ATAC-seq	ENCSR261ICG	embryonic facial prominence_E15.5	2	ENCFF139WZE,ENCFF322LEO	ENCODE
ATAC-seq	ENCSR154BXN	midbrain_E12.5	1	ENCFF680STO,ENCFF476NYE	ENCODE
ATAC-seq	ENCSR154BXN	midbrain_E12.5	2	ENCFF194BBQ,ENCFF867CGI	ENCODE
ATAC-seq	ENCSR903GMO	forebrain_E13.5	1	ENCFF401VUV,ENCFF898NRO	ENCODE
ATAC-seq	ENCSR903GMO	forebrain_E13.5	2	ENCFF721LGJ,ENCFF777UKE	ENCODE
ATAC-seq	ENCSR810HQR	forebrain_E14.5	1	ENCFF048MTG,ENCFF890LGM	ENCODE
ATAC-seq	ENCSR810HQR	forebrain_E14.5	2	ENCFF633MTW,ENCFF666DRJ	ENCODE
ATAC-seq	ENCSR836PUC	forebrain_E16.5	2	ENCFF058IAE,ENCFF765HUX	ENCODE
ATAC-seq	ENCSR836PUC	forebrain_E16.5	1	ENCFF776GDQ,ENCFF588XZG	ENCODE
ATAC-seq	ENCSR211OCS	midbrain_P0	2	ENCFF713VTW,ENCFF166GCN	ENCODE
ATAC-seq	ENCSR211OCS	midbrain_P0	1	ENCFF470CPP,ENCFF073BZO	ENCODE
ATAC-seq	ENCSR310MLB	forebrain_P0	2	ENCFF197GTC,ENCFF209GGJ	ENCODE
ATAC-seq	ENCSR310MLB	forebrain_P0	1	ENCFF296GZG,ENCFF664RZO	ENCODE
ATAC-seq	ENCSR363SKQ	stomach_E16.5	1	ENCFF310DVL,ENCFF545MGJ	ENCODE
ATAC-seq	ENCSR363SKQ	stomach_E16.5	2	ENCFF836EJZ,ENCFF528WNO	ENCODE
ATAC-seq	ENCSR377YDY	limb_E11.5	1	ENCFF156CTY,ENCFF507XBJ	ENCODE
ATAC-seq	ENCSR377YDY	limb_E11.5	2	ENCFF332GCF,ENCFF672BYU	ENCODE
ATAC-seq	ENCSR551WBK	limb_E12.5	1	ENCFF698GFS,ENCFF110KAU	ENCODE
ATAC-seq	ENCSR551WBK	limb_E12.5	2	ENCFF637TUL,ENCFF283VKJ	ENCODE
ATAC-seq	ENCSR700QBR	neural tube_E14.5	2	ENCFF726JSJ,ENCFF461GHH	ENCODE
ATAC-seq	ENCSR700QBR	neural tube_E14.5	1	ENCFF111RFS,ENCFF945IYI	ENCODE
ATAC-seq	ENCSR312LQX	hindbrain_P0	2	ENCFF971XEA,ENCFF215WAD	ENCODE
ATAC-seq	ENCSR312LQX	hindbrain_P0	1	ENCFF547YID,ENCFF131VHT	ENCODE
ATAC-seq	ENCSR559FAJ	forebrain_E12.5	1	ENCFF413XTH,ENCFF119TXW	ENCODE
ATAC-seq	ENCSR559FAJ	forebrain_E12.5	2	ENCFF199UBT,ENCFF171APM	ENCODE
ATAC-seq	ENCSR302LIV	liver_E12.5	2	ENCFF772GVP,ENCFF987EVS	ENCODE
ATAC-seq	ENCSR302LIV	liver_E12.5	1	ENCFF409BPW,ENCFF016UWL	ENCODE
ATAC-seq	ENCSR876SYO	embryonic facial prominence_E14.5	2	ENCFF185IWQ,ENCFF050LTK	ENCODE
ATAC-seq	ENCSR876SYO	embryonic facial prominence_E14.5	1	ENCFF369BHN,ENCFF043JPE	ENCODE
ATAC-seq	ENCSR597BGP	stomach_P0	1	ENCFF756RRU,ENCFF907CZS	ENCODE
ATAC-seq	ENCSR597BGP	stomach_P0	2	ENCFF550FMX,ENCFF601QOX	ENCODE
ATAC-seq	ENCSR966ORC	intestine_E16.5	2	ENCFF278MAA,ENCFF898FIK	ENCODE
ATAC-seq	ENCSR966ORC	intestine_E16.5	1	ENCFF880CTO,ENCFF896RTV	ENCODE
ATAC-seq	ENCSR102NGD	lung_P0	1	ENCFF705BQG,ENCFF269YOD	ENCODE
ATAC-seq	ENCSR102NGD	lung_P0	2	ENCFF370OFA,ENCFF150BVC	ENCODE
ATAC-seq	ENCSR552ABC	heart_E13.5	1	ENCFF100SXY,ENCFF064NKM	ENCODE
ATAC-seq	ENCSR552ABC	heart_E13.5	2	ENCFF406EUS,ENCFF559NSG	ENCODE
ATAC-seq	ENCSR603MWL	heart_E15.5	2	ENCFF385WYM,ENCFF051GLX	ENCODE
ATAC-seq	ENCSR603MWL	heart_E15.5	1	ENCFF829XFO,ENCFF694SPD	ENCODE
ATAC-seq	ENCSR335VJW	lung_E14.5	2	ENCFF204EVA,ENCFF437GNH	ENCODE
ATAC-seq	ENCSR335VJW	lung_E14.5	1	ENCFF979DGG,ENCFF244BWB	ENCODE
ATAC-seq	ENCSR819QOJ	midbrain_E13.5	2	ENCFF667HBE,ENCFF574MEU	ENCODE
ATAC-seq	ENCSR819QOJ	midbrain_E13.5	1	ENCFF659NJR,ENCFF703JZE	ENCODE
ATAC-seq	ENCSR668EIA	lung_E15.5	1	ENCFF507MHY,ENCFF217WJG	ENCODE
ATAC-seq	ENCSR668EIA	lung_E15.5	2	ENCFF159YPF,ENCFF940GZL	ENCODE
ATAC-seq	ENCSR798FDL	hindbrain_E14.5	2	ENCFF868CEL,ENCFF266VLC	ENCODE
ATAC-seq	ENCSR798FDL	hindbrain_E14.5	1	ENCFF673MIM,ENCFF657XRO	ENCODE
ATAC-seq	ENCSR652CNN	heart_E12.5	2	ENCFF719RSO,ENCFF086MTT	ENCODE
ATAC-seq	ENCSR652CNN	heart_E12.5	1	ENCFF377YCK,ENCFF982JWB	ENCODE
ATAC-seq	ENCSR150EOO	intestine_E14.5	1	ENCFF642UHQ,ENCFF188JXF	ENCODE
ATAC-seq	ENCSR150EOO	intestine_E14.5	2	ENCFF593JRX,ENCFF243SQC	ENCODE
ATAC-seq	ENCSR732OTZ	kidney_E16.5	1	ENCFF413HET,ENCFF021GRI	ENCODE
ATAC-seq	ENCSR732OTZ	kidney_E16.5	2	ENCFF573ZOR,ENCFF326PPN	ENCODE
ATAC-seq	ENCSR785NEL	liver_E11.5	1	ENCFF288CVJ,ENCFF888ZZV	ENCODE
ATAC-seq	ENCSR785NEL	liver_E11.5	2	ENCFF883SEZ,ENCFF035OMK	ENCODE
ATAC-seq	ENCSR068YGC	heart_E14.5	1	ENCFF826YDW,ENCFF258GFE	ENCODE
ATAC-seq	ENCSR068YGC	heart_E14.5	2	ENCFF753WMG,ENCFF031SEH	ENCODE
ATAC-seq	ENCSR976LWP	forebrain_E15.5	2	ENCFF248PXW,ENCFF825UHO	ENCODE
ATAC-seq	ENCSR976LWP	forebrain_E15.5	1	ENCFF906VXU,ENCFF500SXI	ENCODE
ATAC-seq	ENCSR273UFV	forebrain_E11.5	1	ENCFF419LDW,ENCFF963YIU	ENCODE
ATAC-seq	ENCSR273UFV	forebrain_E11.5	2	ENCFF083TDB,ENCFF680UAR	ENCODE
ATAC-seq	ENCSR662KNY	hindbrain_E15.5	2	ENCFF213OUA,ENCFF709CLT	ENCODE
ATAC-seq	ENCSR662KNY	hindbrain_E15.5	1	ENCFF761JHA,ENCFF181OVJ	ENCODE
ATAC-seq	ENCSR460BUL	limb_E14.5	1	ENCFF672EYN,ENCFF981BBK	ENCODE
ATAC-seq	ENCSR460BUL	limb_E14.5	2	ENCFF948NPO,ENCFF634JLT	ENCODE
ATAC-seq	ENCSR896XIN	limb_E13.5	1	ENCFF508LJX,ENCFF103PCA	ENCODE
ATAC-seq	ENCSR896XIN	limb_E13.5	2	ENCFF732HZX,ENCFF506NVN	ENCODE
ATAC-seq	ENCSR384JBF	midbrain_E14.5	1	ENCFF769SYG,ENCFF948TDP	ENCODE
ATAC-seq	ENCSR384JBF	midbrain_E14.5	2	ENCFF227ATX,ENCFF307UMX	ENCODE
ATAC-seq	ENCSR217NOA	neural tube_E13.5	1	ENCFF336EQO,ENCFF308AMG	ENCODE
ATAC-seq	ENCSR217NOA	neural tube_E13.5	2	ENCFF843JWW,ENCFF577DNF	ENCODE
ATAC-seq	ENCSR371KFW	heart_E16.5	1	ENCFF416BZL,ENCFF473REG	ENCODE
ATAC-seq	ENCSR371KFW	heart_E16.5	2	ENCFF304CCF,ENCFF778FWU	ENCODE
ATAC-seq	ENCSR032HKE	liver_E14.5	1	ENCFF159HYY,ENCFF911HQX	ENCODE
ATAC-seq	ENCSR032HKE	liver_E14.5	2	ENCFF618OJP,ENCFF863FZP	ENCODE
ATAC-seq	ENCSR079GOY	intestine_P0	2	ENCFF169QUZ,ENCFF742CFJ	ENCODE
ATAC-seq	ENCSR079GOY	intestine_P0	1	ENCFF039HAM,ENCFF780FQW	ENCODE
ATAC-seq	ENCSR820ACB	heart_E11.5	1	ENCFF279LMU,ENCFF820PVO	ENCODE
ATAC-seq	ENCSR820ACB	heart_E11.5	2	ENCFF823XXU,ENCFF518FYP	ENCODE
ATAC-seq	ENCSR023QZX	kidney_E15.5	2	ENCFF395EDU,ENCFF100YMT	ENCODE
ATAC-seq	ENCSR023QZX	kidney_E15.5	1	ENCFF340WOK,ENCFF117TPW	ENCODE
ATAC-seq	ENCSR096JCC	midbrain_E16.5	2	ENCFF518FPU,ENCFF754FRO	ENCODE
ATAC-seq	ENCSR096JCC	midbrain_E16.5	1	ENCFF458XRA,ENCFF882GWB	ENCODE
ATAC-seq	ENCSR468GUI	midbrain_E15.5	2	ENCFF267LVT,ENCFF347QTV	ENCODE
ATAC-seq	ENCSR468GUI	midbrain_E15.5	1	ENCFF187YXW,ENCFF865ZYW	ENCODE
ATAC-seq	ENCSR758IRM	kidney_E14.5	2	ENCFF802JNF,ENCFF576WIS	ENCODE
ATAC-seq	ENCSR758IRM	kidney_E14.5	1	ENCFF958YUR,ENCFF504EBV	ENCODE
ATAC-seq	ENCSR451NAE	heart_P0	2	ENCFF913PMS,ENCFF483MKX	ENCODE
ATAC-seq	ENCSR451NAE	heart_P0	1	ENCFF655OFT,ENCFF999SZR	ENCODE
ATAC-seq	ENCSR282YTE	neural tube_E11.5	2	ENCFF187AQG,ENCFF994STU	ENCODE
ATAC-seq	ENCSR282YTE	neural tube_E11.5	1	ENCFF529IDC,ENCFF149EJI	ENCODE
ATAC-seq	ENCSR690VOH	neural tube_E12.5	1	ENCFF226GOU,ENCFF444RRA	ENCODE
ATAC-seq	ENCSR690VOH	neural tube_E12.5	2	ENCFF927BDD,ENCFF303PPN	ENCODE
ATAC-seq	ENCSR382RUC	midbrain_E11.5	1	ENCFF747ZCB,ENCFF236CXJ	ENCODE
ATAC-seq	ENCSR382RUC	midbrain_E11.5	2	ENCFF315TMY,ENCFF752YLN	ENCODE
ATAC-seq	ENCSR343TXK	liver_E13.5	2	ENCFF360MVK,ENCFF443PRW	ENCODE
ATAC-seq	ENCSR343TXK	liver_E13.5	1	ENCFF382CMV,ENCFF688ZFD	ENCODE
ATAC-seq	ENCSR465PYP	liver_E15.5	1	ENCFF329VCX,ENCFF290ZBP	ENCODE
ATAC-seq	ENCSR465PYP	liver_E15.5	2	ENCFF341HRL,ENCFF489XAT	ENCODE
ATAC-seq	ENCSR486XAS	stomach_E15.5	1	ENCFF997AUJ,ENCFF033YBG	ENCODE
ATAC-seq	ENCSR486XAS	stomach_E15.5	2	ENCFF211VPG,ENCFF442KCP	ENCODE
ATAC-seq	ENCSR609OHJ	liver_P0	1	ENCFF599TJR,ENCFF176IZG	ENCODE
ATAC-seq	ENCSR609OHJ	liver_P0	2	ENCFF957VLH,ENCFF999IJT	ENCODE
ATAC-seq	ENCSR389CLN	kidney_P0	1	ENCFF171RXE,ENCFF144HIW	ENCODE
ATAC-seq	ENCSR389CLN	kidney_P0	2	ENCFF763EEB,ENCFF765QFH	ENCODE
ATAC-seq	ENCSR176BYZ	hindbrain_E13.5	2	ENCFF331IGF,ENCFF409GTD	ENCODE
ATAC-seq	ENCSR176BYZ	hindbrain_E13.5	1	ENCFF071AXB,ENCFF628CBN	ENCODE
ATAC-seq	ENCSR031HDN	embryonic facial prominence_E12.5	1	ENCFF265RFZ,ENCFF547OQI	ENCODE
ATAC-seq	ENCSR031HDN	embryonic facial prominence_E12.5	2	ENCFF492KHX,ENCFF459CPD	ENCODE
ATAC-seq	ENCSR012YAB	hindbrain_E11.5	2	ENCFF753PKM,ENCFF635ZQL	ENCODE
ATAC-seq	ENCSR012YAB	hindbrain_E11.5	1	ENCFF058UWO,ENCFF163UHB	ENCODE
ATAC-seq	ENCSR627OCR	lung_E16.5	1	ENCFF577XAL,ENCFF224TAO	ENCODE
ATAC-seq	ENCSR627OCR	lung_E16.5	2	ENCFF872IZI,ENCFF427HTC	ENCODE
ATAC-seq	ENCSR255XTC	liver_E16.5	2	ENCFF702NAP,ENCFF243ROW	ENCODE
ATAC-seq	ENCSR255XTC	liver_E16.5	1	ENCFF894ZND,ENCFF788EQO	ENCODE
RNA-seq	ENCSR691OPQ	heart_E11.5	1	ENCFF226IWR	ENCODE,GeneQuant
RNA-seq	ENCSR691OPQ	heart_E11.5	2	ENCFF540EJL	ENCODE,GeneQuant
RNA-seq	ENCSR908JWT	midbrain_E12.5	2	ENCFF887JHQ	ENCODE,GeneQuant
RNA-seq	ENCSR908JWT	midbrain_E12.5	1	ENCFF399CQH	ENCODE,GeneQuant
RNA-seq	ENCSR420QTO	hindbrain_E12.5	2	ENCFF242PFZ	ENCODE,GeneQuant
RNA-seq	ENCSR420QTO	hindbrain_E12.5	1	ENCFF983XDK	ENCODE,GeneQuant
RNA-seq	ENCSR504GEG	kidney_E14.5	1	ENCFF499WRT	ENCODE,GeneQuant
RNA-seq	ENCSR504GEG	kidney_E14.5	2	ENCFF413OJO	ENCODE,GeneQuant
RNA-seq	ENCSR851HEC	embryonic facial prominence_E12.5	1	ENCFF594CEM	ENCODE,GeneQuant
RNA-seq	ENCSR851HEC	embryonic facial prominence_E12.5	2	ENCFF742JLO	ENCODE,GeneQuant
RNA-seq	ENCSR401BSG	hindbrain_E15.5	1	ENCFF395LAH	ENCODE,GeneQuant
RNA-seq	ENCSR401BSG	hindbrain_E15.5	2	ENCFF338ZXD	ENCODE,GeneQuant
RNA-seq	ENCSR448MXQ	liver_E13.5	1	ENCFF615ZTQ	ENCODE,GeneQuant
RNA-seq	ENCSR448MXQ	liver_E13.5	2	ENCFF336VTP	ENCODE,GeneQuant
RNA-seq	ENCSR466KZY	stomach_E16.5	1	ENCFF288JNN	ENCODE,GeneQuant
RNA-seq	ENCSR466KZY	stomach_E16.5	2	ENCFF052DOQ	ENCODE,GeneQuant
RNA-seq	ENCSR367ZPZ	midbrain_E16.5	1	ENCFF918YFP	ENCODE,GeneQuant
RNA-seq	ENCSR367ZPZ	midbrain_E16.5	2	ENCFF052VJO	ENCODE,GeneQuant
RNA-seq	ENCSR290RRR	stomach_E14.5	1	ENCFF050PAT	ENCODE,GeneQuant
RNA-seq	ENCSR290RRR	stomach_E14.5	2	ENCFF691EQW	ENCODE,GeneQuant
RNA-seq	ENCSR823VEE	embryonic facial prominence_E14.5	1	ENCFF924CMS	ENCODE,GeneQuant
RNA-seq	ENCSR823VEE	embryonic facial prominence_E14.5	2	ENCFF370UDF	ENCODE,GeneQuant
RNA-seq	ENCSR370SFB	intestine_E15.5	1	ENCFF052THP	ENCODE,GeneQuant
RNA-seq	ENCSR370SFB	intestine_E15.5	2	ENCFF114YCL	ENCODE,GeneQuant
RNA-seq	ENCSR830IVQ	limb_E15.5	1	ENCFF532ZDE	ENCODE,GeneQuant
RNA-seq	ENCSR830IVQ	limb_E15.5	2	ENCFF003DBZ	ENCODE,GeneQuant
RNA-seq	ENCSR537GNQ	kidney_E16.5	1	ENCFF752QKG	ENCODE,GeneQuant
RNA-seq	ENCSR537GNQ	kidney_E16.5	2	ENCFF143OJZ	ENCODE,GeneQuant
RNA-seq	ENCSR347SQR	limb_E13.5	1	ENCFF358WYS	ENCODE,GeneQuant
RNA-seq	ENCSR347SQR	limb_E13.5	2	ENCFF634AUL	ENCODE,GeneQuant
RNA-seq	ENCSR020DGG	heart_E16.5	1	ENCFF415JBI	ENCODE,GeneQuant
RNA-seq	ENCSR020DGG	heart_E16.5	2	ENCFF871IGQ	ENCODE,GeneQuant
RNA-seq	ENCSR284YKY	heart_E13.5	1	ENCFF242GMD	ENCODE,GeneQuant
RNA-seq	ENCSR284YKY	heart_E13.5	2	ENCFF976CYB	ENCODE,GeneQuant
RNA-seq	ENCSR597UZW	heart_E15.5	1	ENCFF440PWB	ENCODE,GeneQuant
RNA-seq	ENCSR597UZW	heart_E15.5	2	ENCFF219PVC	ENCODE,GeneQuant
RNA-seq	ENCSR538WYL	embryonic facial prominence_E13.5	1	ENCFF132NQU	ENCODE,GeneQuant
RNA-seq	ENCSR538WYL	embryonic facial prominence_E13.5	2	ENCFF867TKM	ENCODE,GeneQuant
RNA-seq	ENCSR062VTB	kidney_E15.5	1	ENCFF700YRC	ENCODE,GeneQuant
RNA-seq	ENCSR062VTB	kidney_E15.5	2	ENCFF347HOK	ENCODE,GeneQuant
RNA-seq	ENCSR457RRW	lung_E15.5	1	ENCFF718PJC	ENCODE,GeneQuant
RNA-seq	ENCSR457RRW	lung_E15.5	2	ENCFF996EMC	ENCODE,GeneQuant
RNA-seq	ENCSR727FHP	heart_E14.5	1	ENCFF111IGW	ENCODE,GeneQuant
RNA-seq	ENCSR727FHP	heart_E14.5	2	ENCFF540BJT	ENCODE,GeneQuant
RNA-seq	ENCSR750YSX	limb_E12.5	1	ENCFF879FXB	ENCODE,GeneQuant
RNA-seq	ENCSR750YSX	limb_E12.5	2	ENCFF470WZZ	ENCODE,GeneQuant
RNA-seq	ENCSR611PTP	liver_E15.5	1	ENCFF740SXP	ENCODE,GeneQuant
RNA-seq	ENCSR611PTP	liver_E15.5	2	ENCFF504YJB	ENCODE,GeneQuant
RNA-seq	ENCSR921PRX	hindbrain_E13.5	1	ENCFF131WIM	ENCODE,GeneQuant
RNA-seq	ENCSR921PRX	hindbrain_E13.5	2	ENCFF604LWF	ENCODE,GeneQuant
RNA-seq	ENCSR992WBR	lung_E16.5	1	ENCFF538RNR	ENCODE,GeneQuant
RNA-seq	ENCSR992WBR	lung_E16.5	2	ENCFF365SND	ENCODE,GeneQuant
RNA-seq	ENCSR080EVZ	forebrain_E16.5	1	ENCFF590FAC	ENCODE,GeneQuant
RNA-seq	ENCSR080EVZ	forebrain_E16.5	2	ENCFF484AOO	ENCODE,GeneQuant
RNA-seq	ENCSR285WZV	hindbrain_E16.5	1	ENCFF211ELX	ENCODE,GeneQuant
RNA-seq	ENCSR285WZV	hindbrain_E16.5	2	ENCFF830YBR	ENCODE,GeneQuant
RNA-seq	ENCSR932TRU	intestine_E14.5	1	ENCFF959OHE	ENCODE,GeneQuant
RNA-seq	ENCSR932TRU	intestine_E14.5	2	ENCFF228SAS	ENCODE,GeneQuant
RNA-seq	ENCSR647QBV	forebrain_E12.5	1	ENCFF804FTJ	ENCODE,GeneQuant
RNA-seq	ENCSR647QBV	forebrain_E12.5	2	ENCFF601JPN	ENCODE,GeneQuant
RNA-seq	ENCSR752RGN	forebrain_E15.5	1	ENCFF763GXJ	ENCODE,GeneQuant
RNA-seq	ENCSR752RGN	forebrain_E15.5	2	ENCFF340XFQ	ENCODE,GeneQuant
RNA-seq	ENCSR792RJV	midbrain_E13.5	1	ENCFF422BJI	ENCODE,GeneQuant
RNA-seq	ENCSR792RJV	midbrain_E13.5	2	ENCFF196WAD	ENCODE,GeneQuant
RNA-seq	ENCSR115TWD	neural tube_E13.5	1	ENCFF502BTV	ENCODE,GeneQuant
RNA-seq	ENCSR115TWD	neural tube_E13.5	2	ENCFF049EIV	ENCODE,GeneQuant
RNA-seq	ENCSR557RMA	midbrain_E15.5	1	ENCFF706XGJ	ENCODE,GeneQuant
RNA-seq	ENCSR557RMA	midbrain_E15.5	2	ENCFF835FSF	ENCODE,GeneQuant
RNA-seq	ENCSR648YEP	liver_E12.5	1	ENCFF746VZM	ENCODE,GeneQuant
RNA-seq	ENCSR648YEP	liver_E12.5	2	ENCFF468PFF	ENCODE,GeneQuant
RNA-seq	ENCSR508GWZ	neural tube_E12.5	1	ENCFF353TCZ	ENCODE,GeneQuant
RNA-seq	ENCSR508GWZ	neural tube_E12.5	2	ENCFF224SRI	ENCODE,GeneQuant
RNA-seq	ENCSR848GST	intestine_E16.5	1	ENCFF443JRH	ENCODE,GeneQuant
RNA-seq	ENCSR848GST	intestine_E16.5	2	ENCFF278PAQ	ENCODE,GeneQuant
RNA-seq	ENCSR906YQZ	stomach_E15.5	1	ENCFF972NMO	ENCODE,GeneQuant
RNA-seq	ENCSR906YQZ	stomach_E15.5	2	ENCFF355MOU	ENCODE,GeneQuant
RNA-seq	ENCSR636CWO	embryonic facial prominence_E15.5	1	ENCFF369TLJ	ENCODE,GeneQuant
RNA-seq	ENCSR636CWO	embryonic facial prominence_E15.5	2	ENCFF536XKZ	ENCODE,GeneQuant
RNA-seq	ENCSR004XCU	neural tube_E15.5	1	ENCFF037GWJ	ENCODE,GeneQuant
RNA-seq	ENCSR004XCU	neural tube_E15.5	2	ENCFF365DLM	ENCODE,GeneQuant
RNA-seq	ENCSR970EWM	forebrain_E13.5	1	ENCFF567AFL	ENCODE,GeneQuant
RNA-seq	ENCSR970EWM	forebrain_E13.5	2	ENCFF227HKF	ENCODE,GeneQuant
RNA-seq	ENCSR826HIQ	liver_E16.5	1	ENCFF759PUL	ENCODE,GeneQuant
RNA-seq	ENCSR826HIQ	liver_E16.5	2	ENCFF512KYX	ENCODE,GeneQuant
RNA-seq	ENCSR928OXI	neural tube_E14.5	1	ENCFF513HAL	ENCODE,GeneQuant
RNA-seq	ENCSR928OXI	neural tube_E14.5	2	ENCFF967SJG	ENCODE,GeneQuant
RNA-seq	ENCSR284AMY	liver_E11.5	1	ENCFF954EHG	ENCODE,GeneQuant
RNA-seq	ENCSR284AMY	liver_E11.5	2	ENCFF523MEO	ENCODE,GeneQuant
RNA-seq	ENCSR331XCE	intestine_P0	1	ENCFF485CJB	ENCODE,GeneQuant
RNA-seq	ENCSR331XCE	intestine_P0	2	ENCFF795XBQ	ENCODE,GeneQuant
RNA-seq	ENCSR160IIN	forebrain_E11.5	1	ENCFF465SNB	ENCODE,GeneQuant
RNA-seq	ENCSR160IIN	forebrain_E11.5	2	ENCFF976OLT	ENCODE,GeneQuant
RNA-seq	ENCSR719NAJ	midbrain_P0	1	ENCFF210MWH	ENCODE,GeneQuant
RNA-seq	ENCSR719NAJ	midbrain_P0	2	ENCFF793WMU	ENCODE,GeneQuant
RNA-seq	ENCSR096STK	liver_P0	1	ENCFF875HIA	ENCODE,GeneQuant
RNA-seq	ENCSR096STK	liver_P0	2	ENCFF143HKK	ENCODE,GeneQuant
RNA-seq	ENCSR337FYI	neural tube_E11.5	1	ENCFF375JDR	ENCODE,GeneQuant
RNA-seq	ENCSR337FYI	neural tube_E11.5	2	ENCFF298WHK	ENCODE,GeneQuant
RNA-seq	ENCSR982MRY	lung_P0	1	ENCFF990GIB	ENCODE,GeneQuant
RNA-seq	ENCSR982MRY	lung_P0	2	ENCFF658LQM	ENCODE,GeneQuant
RNA-seq	ENCSR307BCA	midbrain_E11.5	1	ENCFF359ZOA	ENCODE,GeneQuant
RNA-seq	ENCSR307BCA	midbrain_E11.5	2	ENCFF971KZC	ENCODE,GeneQuant
RNA-seq	ENCSR559TRB	hindbrain_E14.5	1	ENCFF206KRT	ENCODE,GeneQuant
RNA-seq	ENCSR559TRB	hindbrain_E14.5	2	ENCFF741FZD	ENCODE,GeneQuant
RNA-seq	ENCSR362AIZ	forebrain_P0	1	ENCFF918QNL	ENCODE,GeneQuant
RNA-seq	ENCSR362AIZ	forebrain_P0	2	ENCFF895JXR	ENCODE,GeneQuant
RNA-seq	ENCSR526SEX	heart_P0	1	ENCFF817KPY	ENCODE,GeneQuant
RNA-seq	ENCSR526SEX	heart_P0	2	ENCFF155GNG	ENCODE,GeneQuant
RNA-seq	ENCSR216NEG	limb_E14.5	1	ENCFF677BPV	ENCODE,GeneQuant
RNA-seq	ENCSR216NEG	limb_E14.5	2	ENCFF794QMH	ENCODE,GeneQuant
RNA-seq	ENCSR848HOX	embryonic facial prominence_E11.5	1	ENCFF772UWT	ENCODE,GeneQuant
RNA-seq	ENCSR848HOX	embryonic facial prominence_E11.5	2	ENCFF262TXH	ENCODE,GeneQuant
RNA-seq	ENCSR760TOE	hindbrain_E11.5	1	ENCFF750FTK	ENCODE,GeneQuant
RNA-seq	ENCSR760TOE	hindbrain_E11.5	2	ENCFF109HTF	ENCODE,GeneQuant
RNA-seq	ENCSR185LWM	forebrain_E14.5	1	ENCFF745ZJF	ENCODE,GeneQuant
RNA-seq	ENCSR185LWM	forebrain_E14.5	2	ENCFF816CVP	ENCODE,GeneQuant
RNA-seq	ENCSR178GUS	stomach_P0	1	ENCFF517XLT	ENCODE,GeneQuant
RNA-seq	ENCSR178GUS	stomach_P0	2	ENCFF434XNZ	ENCODE,GeneQuant
RNA-seq	ENCSR017JEG	hindbrain_P0	1	ENCFF798MSP	ENCODE,GeneQuant
RNA-seq	ENCSR017JEG	hindbrain_P0	2	ENCFF861GUP	ENCODE,GeneQuant
RNA-seq	ENCSR867YNV	liver_E14.5	1	ENCFF432ZGG	ENCODE,GeneQuant
RNA-seq	ENCSR867YNV	liver_E14.5	2	ENCFF572OPZ	ENCODE,GeneQuant
RNA-seq	ENCSR173PJN	kidney_P0	1	ENCFF905HUL	ENCODE,GeneQuant
RNA-seq	ENCSR173PJN	kidney_P0	2	ENCFF783LVC	ENCODE,GeneQuant
RNA-seq	ENCSR541XZK	limb_E11.5	1	ENCFF195JHC	ENCODE,GeneQuant
RNA-seq	ENCSR541XZK	limb_E11.5	2	ENCFF457ZGF	ENCODE,GeneQuant
RNA-seq	ENCSR039ADS	lung_E14.5	1	ENCFF872ABP	ENCODE,GeneQuant
RNA-seq	ENCSR039ADS	lung_E14.5	2	ENCFF226ILJ	ENCODE,GeneQuant
RNA-seq	ENCSR343YLB	midbrain_E14.5	1	ENCFF453GZG	ENCODE,GeneQuant
RNA-seq	ENCSR343YLB	midbrain_E14.5	2	ENCFF870YWY	ENCODE,GeneQuant
RNA-seq	ENCSR150CUE	heart_E12.5	2	ENCFF343NNZ	ENCODE,GeneQuant
RNA-seq	ENCSR150CUE	heart_E12.5	1	ENCFF345NEQ	ENCODE,GeneQuant

config.yml:

input: "input.tsv"
output_dir: "output/"
assembly: "mm10"

Running the analysis

This is a large dataset containing 264 experiments. It is not feasible to analyze such a large dataset on a personal desktop. Nowadays large-scale computing usually happens in the cloud. So in this tutorial I will show you how to use Taiji in a HPC cluster.

First you need to have an access to a HPC cluster that supports slurm or PBS like workload manager. Now, put following lines in your config.yml file:

submit_params: "-q home -l walltime=10:00:00"
submit_command: "qsub"
submit_cpu_format: "-l nodes=1:ppn=%d"
submit_memory_format: "-l mem=%dG

This configuration works for The Triton Shared Computing Cluster (TSCC) at UCSD. You may need to make adjustment for your local environment.

The submission parameters of individual step are configurable as well:

resource:
  RNA_Align:
    parameter: "-q home -l walltime=24:00:00"
    memory: 50

  ATAC_Align:
    memory: 10

Once you have your config.yml file ready, run the analysis using:

taiji run --config config.yml --cloud

Results

Here are some of the QC metrics:

Number of Reads

TSS enrichment

Fragment size distribution

Taiji combines motif scanning, network inference and the PageRank algorithm to rank TFs. This result will be saved in the GeneRank.tsv file. There is also a GeneRank.html file that you can visualize.

TF Ranking scores (showing 639 TFs across 66 tissues)