Chromatin accessibility and transcriptome landscapes of Monomorium pharaonis brain

Wang, Mingyue; Liu, Yang; Wen, Tinggang; Liu, Weiwei; Gao, Qionghua; Zhao, Jie; Xiong, Zijun; Wang, Zhifeng; Jiang, Wei; Yu, Yeya; Wu, Liang; Yuan, Yue; Wei, Xiaoyu; Xu, Jiangshan; Cheng, Mengnan; Zhang, Pei; Li, Panyi; Hou, Yong; Yang, Huanming; Zhang, Guojie; Li, Qiye; Liu, Chuanyu; Liu, Longqi

doi:10.1038/s41597-020-0556-x

Download PDF

Data Descriptor
Open access
Published: 08 July 2020

Chromatin accessibility and transcriptome landscapes of Monomorium pharaonis brain

Mingyue Wang^1,2,3^na1,
Yang Liu^1,2,3^na1,
Tinggang Wen^2,3,
Weiwei Liu ORCID: orcid.org/0000-0001-5082-9114^4,5,
Qionghua Gao^4,5,
Jie Zhao^4,5,
Zijun Xiong ORCID: orcid.org/0000-0003-3923-0703^2,3,
Zhifeng Wang^2,3,
Wei Jiang^2,3,
Yeya Yu^2,3,6,
Liang Wu^1,2,3,
Yue Yuan^1,2,3,
Xiaoyu Wei^1,2,3,
Jiangshan Xu^1,2,3,
Mengnan Cheng^1,2,3,
Pei Zhang^2,3,
Panyi Li^2,3,
Yong Hou ORCID: orcid.org/0000-0002-0420-0726^2,3,
Huanming Yang^1,2,7,
Guojie Zhang^3,4,5,8,
Qiye Li ORCID: orcid.org/0000-0002-5993-0312^2,3,
Chuanyu Liu ORCID: orcid.org/0000-0003-2258-0897^2,3 &
…
Longqi Liu^2,3,9

Scientific Data volume 7, Article number: 217 (2020) Cite this article

3162 Accesses
9 Citations
4 Altmetric
Metrics details

Subjects

Abstract

The emergence of social organization (eusociality) is a major event in insect evolution. Although previous studies have investigated the mechanisms underlying caste differentiation and social behavior of eusocial insects including ants and honeybees, the molecular circuits governing sociality in these insects remain obscure. In this study, we profiled the transcriptome and chromatin accessibility of brain tissues in three Monomorium pharaonis ant castes: queens (including mature and un-mated queens), males and workers. We provide a comprehensive dataset including 16 RNA-sequencing and 16 assay for transposase accessible chromatin (ATAC)-sequencing profiles. We also demonstrate strong reproducibility of the datasets and have identified specific genes and open chromatin regions in the genome that may be associated with the social function of these castes. Our data will be a valuable resource for further studies of insect behaviour, particularly the role of brain in the control of eusociality.

Measurement(s)	mRNA • open_chromatin_region • brain
Technology Type(s)	RNA sequencing • ATAC-seq
Factor Type(s)	caste
Sample Characteristic - Organism	Monomorium pharaonis

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12111210

A single-cell transcriptomic atlas tracking the neural basis of division of labour in an ant superorganism

Article Open access 16 June 2022

The effect of the brood and the queen on early gene expression in bumble bee workers' brains

Article Open access 22 February 2022

Gene expression and epigenetics reveal species-specific mechanisms acting upon common molecular pathways in the evolution of task division in bees

Article Open access 11 February 2021

Background & Summary

Eusocial insects have their societies based on caste polyphenism, where one or more queens are exclusively responsible for reproduction¹. In contrast, workers, the largest population in the colony, are almost sterile and responsible for supporting the entire community through their labor, including collecting food, maintaining the nest and feeding/protecting the newly hatched larvae². Eusociality in the hymenopteran insects has evolved 10 times independently^3,4. Understanding eusociality in insects is important not only from an evolutionary or environmental perspective but also because it may provide clues into the behavior traits of higher species including humans.

Genes differentially expressed across castes in the brains of insects contribute to social behavior development^5,6. Several studies have focused on the overlapping genes or pathways associated with the division of labor across different eusocial insect lineages and constructed a set of conserved gene regulatory networks^7,8. In this regard, one of the key hypotheses for evolution of eusociality emphasized the important role of a core toolkit of genes involved in highly conserved pathways, such as metabolism and reproduction^9,10. In addition, it is also widely accepted that certain single genes can play pivotal roles. For instance, increasing insulin-like peptide 2 (ilp2) levels can break larval suppression and induce a stable division of labor in Ooceraea biroi¹¹. Likewise, the neuropeptide corazonin inhibits the transition from worker to gamergate in Harpegnathos saltator¹². Alternatively, many other studies have recognized the importance of taxonomically restricted genes in the evolution of eusocial behavior and performed a systematic comparison of the participation degree of shared genes and taxonomically restricted genes in eusocial division of labor^13,14,15,16. Despite these relevant studies, the comprehensive lists of genes associated with eusocial behavior and their interrelationship are still unknown.

Besides gene expression, epigenetic regulation is also recognized as an important facet in the regulation of caste-specific behavior in insects. For example, histone modifications are critical regulators of caste determinations in Camponotus floridanus, as it was shown that distinct histone H3K27ac patterns exist between castes of C. floridanus¹⁷. Likewise, caste-specific behavior in C. floridanus can be reprogrammed by treatment with a small-molecular inhibitor of histone deacetylases, suggesting a regulatory role for histone acetylation in eusocial behavior plasticity¹⁸. The role of DNA methylation in caste determination has also been investigated in honeybees and, interestingly, some of the differentially methylated CpG sites correspond to regulatory regions of genes involved in metabolic pathways¹⁹. Additionally, distinct DNA methylation patterns in queen and worker larvae have been reported in another eusocial insect, the termite Zootermopsis nevadensis¹⁹.

Taken together, these reports suggested crucial roles of transcription and epigenetics in shaping caste differentiation and controlling social behavior in insects. However, a comprehensive dataset of both layers is still lacking, hampering further advances in the field of eusociality.

Here, we constructed the transcriptome and chromatin accessibility landscapes of brain tissue of Monomorium pharaonis (Fig. 1a), which is the most ubiquitous house ant in the world^20,21. Monomorium pharaonis consists of three adult castes, workers, queens, and males, with the queen caste containing unmated queens (gynes) and mature queens. These four adult groups possess distinct morphologies, lifespans and behaviors, making it an ideal model to explore the molecular and neural regulatory mechanisms of eusociality²². We sequenced 32 samples from the four groups of ants (16 RNA-seq and 16 ATAC-seq with four biological replicates per group). After data quality assessment and filtering, we obtained a total of 240 Gb high-quality base pairs for the RNA-sequencing, with more than 95% Q20 bases and approximately 149 million reads per sample. For the ATAC-sequencing, we obtained a total of 170 Gb high-quality base pairs reads, with approximately 106 million reads per sample.

Methods

Experimental design

Four adult groups (workers, gynes, males and queens) of Monomorium pharaonis were used for brain RNA-sequencing and ATAC-sequencing profiling. We collected eight brain samples per caste group to perform these assays. A total of 32 ant brains were used. Each brain was used as a biological sample for either the RNA-sequencing or ATAC-sequencing. The experimental design and analysis workflow are shown in Fig. 1.

Animals

All procedures related to animals in this study were approved by the Institutional Review Board on Ethics Committee of BGI (Permit No. FT 19046). Two colonies of Monomorium pharaonis were created from a source colony (MP-MQ064) that was collected in June 2016 from Xishuangbanna in the Yunnan province in China. We pre-assigned about 200 workers, 10 queens and about 200 total larvae in each of the colonies. The age of the selected queens was unknown. The two colonies used in the study were created at the State Key Laboratory of Genetic Resource and Evolution, Kunming Institution of Zoology and then sent to the China National Gene Bank, BGI-Shenzhen. Ants were maintained for eight weeks before sampling, at a constant temperature of 25 °C and 50% humidity and fed with mealworm¹². There were about 200 workers, 10 queens, 5 and 8 males, and 7 and 9 gynes in each colony, respectively, when sampling.

Brain collection and RNA extraction

Designated ants were picked out and anaesthetized in a dissection dish on ice. The ants were then washed with ethanol and PBS twice, and dissected in PBS on ice under the light microscope (OLYMPUS, SZX16). To perform the dissection, a pair of forceps held the ant body while another pair of forceps inserted into the mouth gripped the head cuticle of the ant. The head was gently pulled off and the body discarded. Head cuticle was then gently peeled off with the forceps and the brain was removed. After carefully removing the surrounding trachea and ocelli, the ant brain was placed into PBS with 1 U/mL RNase inhibitor. All ants were dissected using the same method except that ocelli removal was not required for the workers. Brain samples were then washed twice with 500 μl PBS. All samples were collected during daytime (9:00 to 16:00). Whole-brain RNA was extracted immediately after dissection using an RNeasy Mini Kit (Qiagen) and eluted with 10 μl of nuclease-free water (NF-water, Ambion). The total amounts of RNA were measured using an RNA HS Qubit (Invitrogen).

RNA-sequencing library construction

We applied an optimized Smart-seq2 method for RNA-sequencing library construction²³. For cDNA generation, the following premixed reagent was added to each tube of RNA sample: 5 μl of 10 μM oligo-dT primer (5ʹ-AAGCAGTGGTATCAACGCAGAGTACT30VN-3ʹ, where “V” is either “A”, “C”, or “G”, and “N” is any base), 4.86 μl of 10 mM dNTP (New England Biolabs), 0.5 μl of 40 U/μl RNase inhibitor (New England Biolabs). Based on the amount of RNA, ERCC Spike-In (Ambion) was added to each tube. Then, the mix was incubated at 72 °C for 3 minutes and quickly placed on ice. Afterward, 20 μl of first-strand synthesis mix containing 8 μl of 5X first-strand buffer (Invitrogen), 2 μl of 100 mM dithiothreitol (DTT, Invitrogen), 2 μl of 200U/μl SuperScript II Reverse Transcriptase (Invitrogen), 8 μl of 5 M Betaine (Sigma), 0.24 μl of 1 M MgCl₂ (Millipore), and 0.4 μl of 100 μM template switch oligo (5ʹ-AAGCAGTGGTATCAACGCAGAGTACATrGrG + G-3ʹ, where “r” indicates a ribonucleic acid base and “+” indicates a locked nucleic acid base, TSO, Exiqon) were added. RNA was reverse transcribed at 42 °C for 90 minutes, and 10 cycles of 50 °C for 2 minutes and 42 °C for 2 minutes and a final 70 °C for 5 minutes to inactivate the reverse transcriptase. cDNA amplification mix containing 50 μl of KAPA HiFi HotStart ReadyMix (KAPA Biosystems), 1 μl of 10 μM IS primer (5ʹ-AAGCAGTGGTATCAACGCAGAGT-3ʹ) and 9 μl of NF-water were then added. The amplification followed the following steps: 98 °C for 3 minutes, followed by 13 cycles of 98 °C for 20 seconds, 67 °C for 20 seconds, 72 °C for 6 minutes and finally 72 °C for 5 minutes. Afterwards, the PCR product was purified using 1X AMPure XP beads (Beckman Coulter). We measured cDNA concentration with the Qubit dsDNA HS Assay Kit 3.0 (Invitrogen) and analyzed size distribution on an HS DNA chip bioanalyzer (Agilent). Libraries were prepared using a fragmentation based method²⁴. For each sample, 300 ng of cDNA was sheared with NEBNext dsDNA Fragmentase (New England Biolabs). Fragmented DNA was then purified, end-repaired, adapter-added, amplified and size-selected. Afterwards, the library size distribution was detected using an HS DNA chip bioanalyzer; the fragment length was in the range from 300 to 500 bp.

ATAC-seq library preparation

We used a whole-brain transposition method for ATAC-sequencing library construction, as previously described²⁵, with minor modifications. In brief, brains were dissected and washed twice with 500 μl ice-cold PBS. After centrifugation at 500 x g for 5 minutes, the samples were lysed with 50 μl lysis buffer (10 mM Tris-HCl, 10 mM NaCl, 3 mM MgCl₂, 0.1% IGEPAL CA-630). We next mixed the samples harshly by pipetting and then centrifuged at 800 x g for 10 minutes. Supernatants were discarded and replaced with a 50 μl transposition reaction mix containing 10 mM TAPS-NaOH (pH 8.5), 5 mM MgCl₂, 10% DMF, 2.5 μl of in-house Tn5 transposase (0.8 U/μl) and NF-water. This mixture was incubated at 37 °C for 30 minutes. Afterwards, transposed DNA was purified with MinElute Purification Kit (Qiagen) and amplified with primers containing barcodes.

Sequencing

All data were generated with the BGISEQ-500 platform (MGI)²⁶. First, the DNA concentration of each library was measured by Qubit dsDNA HS Assay Kit 3.0. A total of 300 ng of library DNA with different sample indexes was pooled for circular single-strand DNA (ssDNA circles). Then, ssDNA circles were used as a template to make DNA nanoballs by rolling circle replication. DNA nanoballs were loaded onto the sequencer flowcells for 100 bp paired-end for RNA-seq and 50 bp paired-end for ATAC-seq.

RNA-sequencing dataset processing

Quality validation of raw reads was performed using FastQC (version 0.11.6)²⁷. Reads of low quality were filtered using SOAPnuke (version 1.5.2)²⁸. Adapter sequences, primers, poly-A tails were found and removed by cutadapt (version 1.16)²⁹. Further quality control was performed by FastQC to ensure the cleaned data were suitable for downstream analyses. Quality control results³⁰ were visualized using multiQC (version: 1.7)³¹. Statistical results of raw data and clean data are displayed in Table 1. Cleaned reads were mapped to the reference Monomorium pharaonis genome (GCA_003260585.2)³² using hisat2 (version 2.0.1-beta)³³. The number of reads aligning to every gene of each sample were calculated with featureCounts (version 1.5.3)³⁴ to generate a raw count matrix³⁰. Aligned BAM reads were inputted into featureCounts (version 1.5.3) with a list of genomic features in Gene Transfer Format (GTF, ref_ASM326058v2_top_level.gff3.gz). To normalize read counts for sequencing depth and RNA composition, we used the median of ratios method in the R (version 3.5) package DESeq2 (version 1.5.3)³⁵. The plotPCA function of DESeq2 (version 1.5.3) was used to assess the similarity of genomic specific gene expression patterns among different groups (Fig. 2c). Pearson correlation coefficients between samples (Fig. 2e, f) were calculated based on DEseq2 normalized data matrix.

Table 1 RNA-seq metadata and mapping statistics.

Full size table

ATAC-sequencing dataset processing

Raw ATAC-seqquencing data were processed including trimming, aligning, filtering, and quality controlling using an ATAC-sequencing pipeline³⁶. MACS2 (version 2.1.2)³⁷ based on python 2.7 was used to identify the peaks of accessible regions. We applied the IDR algorithm³⁸ to identify peaks reproducible between replicates of each caste. Overlapping peaks were subsequently merged by bedtools (version: 2.26.0) intersect³⁹ to produce the final consensus peak set. The full statistical results of data processing and the number of consensus peaks for each sample are listed in Table 2. A standard peak list was generated by merging peaks of all samples using bedtools merge³⁹. The usable reads of each sample were then mapped to the regions of standard peaks using the intersect function of bedtools and the number of mapped reads was counted and listed in a matrix³⁰. We normalized this raw count matrix using the median of ratios method of the R package DESeq2 (version 1.5.3). This normalized matrix was subjected to Pearson correlation coefficients calculation between replicates and principal component analysis (PCA) (Fig. 3c) by DEseq2.

Table 2 ATAC-seq metadata and mapping statistics.

Full size table

Identification of widely expressed or specific genes across the four groups

The raw gene expression matrix was normalized by Reads Per Million mapped reads (RPM). We calculated the average of RPM value, and the coefficient of variation (CV) between the four groups. We selected genes with mean value of RPM greater than 300 and the CV value less than 10% as co-expressed genes. We used the Shannon entropy⁴⁰ to compute the specificity index for genes and we defined its relative gene expression level in a group type i as Ri = Ei/ΣE, where Ei is the RPM value for the gene in the group i, ΣE is the sum of RPM values in all groups and N is the total number of groups. The entropy score for each gene across groups was defined as H = −1 * sum (Ri * log2Ri) (1 < i < N), where the value of H ranges between 0 to log2(N). An entropy score close to zero indicates that the expression of the gene in question is highly specific based on the score distribution, whereas genes with entropy score less than 1.5 were selected as group-specific genes. This result was provided in Figshare³⁰.

Comparative analysis across groups

Comparative analysis was performed using DESeq2 R package. The fold change value between groups and the corresponding P value was calculated. We selected the genes or peaks with fold change ≥ 1 and Padj value ≤ 0.05 as differentially expressed genes (DEGs) or differentially accessible regions (DARs).

Data Records

A complete list of the 32 ant brain samples is provided in Tables 1 and 2. All raw data in this study are available in the NCBI Gene Expression Omnibus (GEO)⁴¹ and in the CNGB Sequence Archive (CNSA)⁴² (https://db.cngb.org/cnsa/). The multiQC results and matrix of gene count and DEG statistics were submitted to Figshare³⁰.

Technical Validation

RNA-sequencing metrics and reproducibility

A total of 16 RNA libraries were prepared and sequenced, with the sequencing depth ranging from 104.63 to 171.60 million reads. Raw reads were filtered, resulting in percentages of clean reads ranging between 75% and 86% (Table 1). The Q20 values for the clean reads were above 95% (Table 1). The quality of sequencing was validated by FastQC, then multiple results were compared with MultiQC and a representative result (all gyne samples) of the visualized Phred quality score per base was shown in Fig. 2a. The CG content ranged from 40% to 45%, following a normal distribution (Fig. 2b). Clean reads were then mapped to Monomorium pharaonis genome. A full statistics of quality control for each sample was displayed in Table 1.

The reproducibility of replicates of RNA-sequencing datasets was examined using PCA, in which samples were clearly separated by caste categories, with PC1 and PC2 jointly explaining 76% of the total variance in gene expression (Fig. 2c). Heatmap clustering of Pearson correlation coefficients from the comparison of the 16 datasets revealed a strong correlation between replicates of the same caste ants (Fig. 2d). Interestingly, three female groups (queens, gynes, and workers) had a nearer distance between each other than their distance to the male group. Pearson correlation analysis showed a correlation coefficient above 0.99 between replicates, revealing high reliability of the RNA-sequencing data (Fig. 2e). The RNA-sequencing data in our study were comparable with previously published RNA-sequencing data of gynes and workers⁷ (Fig. 2f). Taken together, these results suggest that our datasets are a reliable data resource for future studies.

ATAC-sequencing quality control

We performed the quality assessment of ATAC-sequencing datasets by a variety of quality metrics (Table 2), including number of reads, mapping rate, and usable reads. Each sample obtained an average of 49 million usable reads after filtration, resulting in about 20, 000 reproducible peaks after IDR analysis (Table 2). We calculated the reads enrichment around transcription start sites (TSS) and observed a strong enrichment (Table 2 and Fig. 3a), suggesting the high quality of the datasets. This was also supported by the periodic pattern of fragment size, consistent with previous ATAC-sequencing profiles^43,44 (Fig. 3b). Reproducibility between replicates was measured by Pearson correlation coefficients and all the replicates from each caste own the correlation coefficient more than 0.95 (Fig. 3c). The reproducibility of ATAC-sequencing datasets was further studied using PCA, where samples from the same caste tended to cluster together (Fig. 3d). As expected, we noted that the ATAC-sequencing samples presented a similar clustering result as RNA-sequencing, with the three female groups being closer to each other. Overall, these analyses demonstrated that our ATAC-sequencing datasets can reliably detect accessible regions in the genome and can be used to further explore the molecular foundation between epigenomic regulation and social behavior.

Comparative analysis between castes

We identified a set of genes widely expressed in the brain of all castes and also caste brain-specific genes as well³⁰. We found that genes co-expressed in the brains of four groups (600 genes) have a larger number than caste-specific genes (144 genes). These two sets of genes are provided in Figshare³⁰, which can be used for further analysis and exploration. We counted the number of DEGs (Fig. 4a) or DARs (Fig. 4b) in gynes, workers and males compared with queens. We found that males show the biggest difference with queens in both gene expression and chromatin accessibility, suggesting that sex may be the most significant factor resulting in differential regulation of gene expression within the ant colony. On the contrary, gynes and queens presented the smallest difference, with only 229 DEGs and 1,350 DARs. The number of DEGs (583) and DARs (2,171) between queens and workers was almost twice as those between queens and gynes, suggesting higher similarity of the latter two.

We next investigated the relationship between expression and chromatin accessibility for DEGs across the four castes (Fig. 4c). Interestingly, we found that locusta insulin-related peptide (LIRP) and vitellogenin-2 (vg-2) show high expression level in queens. LIRP is a type of 5 kDa peptide and first discovered from locust corpora cardiaca (CCs)-extracts^45,46. LIRP contains 3 exons separated by 2 introns, resembling the vertebrate insulin genes^47,48, whose function is to regulate eusocial division of labor and caste determination and was reported to show consistently higher expression in queens^9,11. Vitellogenin (Vg) encodes for the major egg yolk protein precursor in insects and many other oviparous species⁴⁹. Our finding is supported by a previous study demonstrating that Vg showed higher expression in reproductive groups of eusocial insects, as it functions as a lipid carrier that provisions developing oocytes with yolk and constitutes a reliable indicator of female reproductive activity⁵⁰. Small lysine-rich protein 1 (SMKR1), ras-related and estrogen-regulated growth inhibitor (RERG), and pro-sesilin were also identified as caste-specific genes expressed in gyne, worker and male brains, respectively. SMKR1 is a lysine-rich protein and may play an important role in brain development in unmated female ants⁵¹. RERG is a member of the RAS superfamily of GTPases and a estrogen-regulated growth inhibitor. The higher expression of RERG in worker is consistent with previous study of worker-biased genes in eusocial insects⁵². Resilin is an elastomeric protein found in many insects⁵³. The high expression of pro-resilin may enable males to jump or pivot wings efficiently.

Interestingly, the open regions near these genes showed similar patterns as gene expression across castes (Fig. 4c), suggesting that their transcriptional regulatory elements are crucial for the differential gene expression. Moreover, we found two genes involved in vision, retinal homeobox protein Rx1 (Rx1) and glycine receptor subunit alpha-3 (Glra3), showing lower levels of both expression and chromatin accessibility in workers, which suggests distinct visual systems across workers and the three other groups. Supporting this, it has been previously reported that ocelli is absent in workers of Monomorium pharaonis⁵⁴. In summary, our study provides an important resource of the epigenome and transcriptome of ant brain, which will be of great importance to study the regulatory mechanisms behind caste differentiation in eusocial insects.

Usage Notes

The RNA-seq data processing pipeline, including data filtering, read mapping and gene expression quantification was run on the Linux operating system (centOS). The optimized parameters are provided in the main text. Differential gene expression (DGE) analysis R source codes used for the downstream data analysis and visualization are provided in Supplementary File 1.

Code availability

Data processing was performed using open source software. The approach of tools and parameters used were as below.

SOAPnuke: https://github.com/BGI-flexlab/SOAPnuke. Version: 1.5.2. Parameters: filter -A 0.5 -M 2 -l 10 -q 0.3 -n 0.05 -Q 2 -d.

Cutadapt: https://cutadapt.readthedocs.io/en/stable/. Version: 1.16. Parameters: -m 5 -e 0.10.

HISAT2: http://www.ccb.jhu.edu/software/hisat. Version 2.0.1-beta. Parameters: -p 4 –phred33 –sensitive –no-discordant –no-mixed -I 1 -X 1000.

featureCounts: http://subread.sourceforge.net/. Version 1.5.3. Parameters: -T 5 -p -t exon -g gene_id.

MACS2: https://github.com/taoliu/MACS. Version 2.1.2. Parameters: macs2 callpeak -t input.bam -f BAM -g 259040147 -n name.output -B -q 0.01 --nomodel.

Bedtools: https://bedtools.readthedocs.io/en/latest/content/tools/intersect.html. Version: 2.26.0. Parameters: bedtools intersect -a standardpeak.bed -b input.bam -c > output.count.

The R code used for calculating the correlation and comparative analysis are available in the supplementary materials.

References

Libbrecht, R. et al. Interplay between insulin signaling, juvenile hormone, and vitellogenin regulates maternal effects on polyphenism in ants. Proc Natl Acad Sci USA 110, 11050–11055 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Nowak, M., Tarnita, C. & Wilson, E. The evolution of eusociality. Nature 466, 1057–1062 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Brady, S. G., Schultz, T. R., Fisher, B. L. & Ward, P. S. Evaluating alternative hypotheses for the early evolution and diversification of ants. Proc Natl Acad Sci USA 103, 18172–18177 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Hines, H. M. Historical biogeography, divergence times, and diversification patterns of bumble bees (Hymenoptera: Apidae: Bombus). Syst. Biol 57, 58–75 (2008).
Article PubMed Google Scholar
Barchuk, A. R. et al. Molecular determinants of caste differentiation in the highly eusocial honeybee Apis mellifera. BMC Dev Biol 7, 70 (2007).
Article PubMed PubMed Central CAS Google Scholar
Berens, A. J., Hunt, J. H. & Amy, L. Toth. Comparative transcriptomics of convergent evolution: different genes but conserved pathways underlie caste phenotypes across lineages of eusocial insects. Mol Biol Evol 32, 690–703 (2015).
Article CAS PubMed Google Scholar
Qiu, B. et al. Towards reconstructing the ancestral brain gene-network regulating caste differentiation in ants. Nat Ecol Evol 2, 1782–1791 (2018).
Article PubMed PubMed Central Google Scholar
Woodard, S. H. et al. Genes involved in convergent evolution of eusociality in bees. Proc Natl Acad Sci USA 108, 7472–7477 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Toth, A. L. & Robinson, G. E. Evo-devo and the evolution of social behavior. Trends Genet 23, 334–341 (2007).
Article CAS PubMed Google Scholar
Toth, A. L. et al. Brain transcriptomic analysis in paper wasps identifies genes associated with behaviour across social insect lineages. Proc Biol Sci 277, 2139–2148 (2010).
CAS PubMed PubMed Central Google Scholar
Chandra, V. et al. Social regulation of insulin signaling and the evolution of eusociality in ants. Science 361, 398–402 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Gospocic, J. et al. The neuropeptide corazonin controls social behavior and caste identity in ants. Cell 170, 748–759. e712 (2017).
Article CAS PubMed PubMed Central Google Scholar
Johnson, B. R. & Tsutsui, N. D. Taxonomically restricted genes are associated with the evolution of sociality in the honey bee. PLoS One 12, 164 (2011).
Google Scholar
Ferreira, P. G. et al. Transcriptome analyses of primitively eusocial wasps reveal novel insights into the evolution of sociality and the origin of alternative phenotypes. Genome Biol 14, R20 (2013).
Article PubMed PubMed Central CAS Google Scholar
Feldmeyer, B., Elsner, D. & Foitzik, S. Gene expression patterns associated with caste and reproductive status in ants: worker‐specific genes are more derived than queen‐specific ones. Mol Ecol 23, 151–161 (2014).
Article CAS PubMed Google Scholar
Mikheyev, A. S. & Linksvayer, T. A. Genes associated with ant social behavior show distinct transcriptional and evolutionary patterns. Elife 4, e04775 (2015).
Article PubMed PubMed Central Google Scholar
Simola, D. F. et al. A chromatin link to caste identity in the carpenter ant Camponotus floridanus. Genome Res 23, 486–496 (2013).
Article CAS PubMed PubMed Central Google Scholar
Simola, D. F. et al. Epigenetic (re) programming of caste-specific behavior in the ant Camponotus floridanus. Science 351, aac6633 (2016).
Article PubMed PubMed Central CAS Google Scholar
Foret, S. et al. DNA methylation dynamics, metabolic fluxes, gene splicing, and alternative phenotypes in honey bees. Proc Natl Acad Sci USA 109, 4968–4973 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Berndt, K. P. & Eichler, W. Die Pharaoameise, Monomorium pharaonis (L.)(Hym., Myrmicidae). Mitt. Mus. Nat.kd. Berl., Zool. Reihe 63, 3–186 (1987).
Article Google Scholar
Wetterer, J. K. Worldwide spread of the pharaoh ant, Monomorium pharaonis (Hymenoptera: Formicidae). Myrmecological News 13, 115–129 (2010).
Google Scholar
Johnson, R. A. & Overson, R. P. Population and colony structure and morphometrics in the queen dimorphic little black ant, Monomorium sp. AZ-02, with a review of queen phenotypes in the genus Monomorium. PLoS One 12, e0180595 (2017).
Article PubMed PubMed Central CAS Google Scholar
Picelli, S. et al. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nat Methods 10, 1096–1098 (2013).
Article CAS PubMed Google Scholar
Head, S. R. et al. Library construction for next-generation sequencing: overviews and challenges. Biotechniques 56, 61–77 (2014).
Article CAS PubMed PubMed Central Google Scholar
Davie, K. et al. A single-cell transcriptome atlas of the aging Drosophila brain. Cell 174, 982–998. e920 (2018).
Article CAS PubMed PubMed Central Google Scholar
Huang, J. et al. A reference human genome dataset of the BGISEQ-500 sequencer. Gigascience 6, gix024 (2017).
Article CAS Google Scholar
Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data, http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2015).
Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7, gix120 (2018).
Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10–12 (2011).
Article Google Scholar
Liu, Y. et al. An integrated chromatin accessibility and transcriptome landscape of Monomorium pharaonis brain. figshare https://doi.org/10.6084/m9.figshare.c.4745942.v4 (2020).
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Morandin, C. et al. Comparative transcriptomics reveals the conserved building blocks involved in parallel evolution of diverse phenotypic traits in ants. Genome Biol 17, 43 (2016).
Article PubMed PubMed Central CAS Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
Article PubMed PubMed Central CAS Google Scholar
Koh, P. W. et al. An atlas of transcriptional, chromatin accessibility, and surface marker changes in human mesoderm development. Sci Data 3, 160109 (2016).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol 9, R137 (2008).
Article PubMed PubMed Central CAS Google Scholar
Li, Q., Brown, James., B., Huang, H. & Bickel, P. J. Measuring reproducibility of high-throughput experiments. Ann. Appl. Stat 5, 1752–1779 (2011).
Article MathSciNet MATH Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Schug, J. et al. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 6, R33 (2005).
Article PubMed PubMed Central CAS Google Scholar
Gene Expression Omnibus, https://identifiers.org/geo:GSE143056 (2019).
CNGB. Nucleotide Sequence Archive https://db.cngb.org/search/project/CNP0000740/ (2019).
Ou, J. et al. ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data. BMC Genomics 19, 169 (2018).
Article PubMed PubMed Central CAS Google Scholar
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10, 1213 (2013).
Article CAS PubMed PubMed Central Google Scholar
Claeys, I. et al. Insulin-related peptides and their conserved signal transduction pathway. Peptides 23, 807–816 (2002).
Article CAS PubMed Google Scholar
Hetru, C., Li, K. W., Bulet, P., Lagueux, M. & Hoffmann, J. A. Isolation and structural characterization of an insulin‐related molecule, a predominant neuropeptide from Locusta migratoria. Eur J Biochem 201, 495–499 (1991).
Article CAS PubMed Google Scholar
Wu, Q. & Brown, M. R. Signaling and function of insulin-like peptides in insects. Annu Rev Entomol 51, 1–24 (2006).
Article CAS PubMed Google Scholar
Lagueux, M., Lwoff, L., Meister, M., Goltzené, F. & Hoffmann, J. A. cDNAs from neurosecretory cells of brains of Locusta migratoria (Insecta, Orthoptera) encoding a novel member of the superfamily of insulins. Eur J Biochem 187, 249–254 (1990).
Article CAS PubMed Google Scholar
Tufail, M., Nagaba, Y., Elgendy, A. M. & Takeda, M. Regulation of vitellogenin genes in insects. Entomological Science 17, 269–282 (2014).
Article Google Scholar
Corona, M. et al. Vitellogenin underwent subfunctionalization to acquire caste and behavioral specific expression in the harvester ant Pogonomyrmex barbatus. PLoS Genet 9 (2013).
Ukmar-Godec, T. et al. Lysine/RNA-interactions drive and regulate biomolecular condensation. Nat Commun 10, 1–15 (2019).
Article CAS Google Scholar
Warner, M. R., Qiu, L., Holmes, M. J., Mikheyev, A. S. & Linksvayer, T. A. Convergent eusocial evolution is based on a shared reproductive groundplan plus lineage-specific plastic genes. Nat Commun 10, 1–11 (2019).
Article CAS Google Scholar
Qin, G., Hu, X., Cebe, P. & Kaplan, D. L. Mechanism of resilin elasticity. Nat Commun 3, 1–9 (2012).
Article CAS Google Scholar
Narendra, A., Ramirez-Esquivel, F. & Ribi, W. A. Compound eye and ocellar structure for walking and flying modes of locomotion in the Australian ant, Camponotus consobrinus. Sci Rep 6, 22331 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank all members of Center of Digital Cells and Center of Biodiversity Genomics from BGI-Shenzhen for helpful comments. We thank all members of State Key Laboratory of Genetic Resource and Evolution from Kunming Institution of Zoology for assistance with sample collection. We thank Miguel A. Esteban and Carl Ward from Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences for revising the manuscript. We sincerely thank the support provided by China National GeneBank. This work was supported by National Natural Science Foundation of China (No. 31900466), Natural Science Foundation of Guangdong Province, China (No.2018A030313379), Shenzhen Municipal Government of China (No. 20170731162715261) and Shenzhen Bay Laboratory (No. SZBL2019062801012).

Author information

These authors contributed equally: Mingyue Wang, Yang Liu.

Authors and Affiliations

BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
Mingyue Wang, Yang Liu, Liang Wu, Yue Yuan, Xiaoyu Wei, Jiangshan Xu, Mengnan Cheng & Huanming Yang
BGI-Shenzhen, Shenzhen, 518083, China
Mingyue Wang, Yang Liu, Tinggang Wen, Zijun Xiong, Zhifeng Wang, Wei Jiang, Yeya Yu, Liang Wu, Yue Yuan, Xiaoyu Wei, Jiangshan Xu, Mengnan Cheng, Pei Zhang, Panyi Li, Yong Hou, Huanming Yang, Qiye Li, Chuanyu Liu & Longqi Liu
China National Gene Bank, BGI-Shenzhen, Shenzhen, 518120, China
Mingyue Wang, Yang Liu, Tinggang Wen, Zijun Xiong, Zhifeng Wang, Wei Jiang, Yeya Yu, Liang Wu, Yue Yuan, Xiaoyu Wei, Jiangshan Xu, Mengnan Cheng, Pei Zhang, Panyi Li, Yong Hou, Guojie Zhang, Qiye Li, Chuanyu Liu & Longqi Liu
State Key Laboratory of Genetic Resource and Evolution, Kunming Institution of Zoology, Chinese Academy of Science, Kunming, 650223, China
Weiwei Liu, Qionghua Gao, Jie Zhao & Guojie Zhang
Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Science, Kunming, 650223, China
Weiwei Liu, Qionghua Gao, Jie Zhao & Guojie Zhang
BGI College, Zhengzhou University, Zhengzhou, 450000, China
Yeya Yu
James D. Watson Institute of Genome Sciences, Hangzhou, 310013, China
Huanming Yang
Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, DK-2100, Denmark
Guojie Zhang
Shenzhen Bay Laboratory, Shenzhen, 518083, China
Longqi Liu

Authors

Mingyue Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Tinggang Wen
View author publications
You can also search for this author in PubMed Google Scholar
Weiwei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qionghua Gao
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zijun Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Zhifeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yeya Yu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yue Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyu Wei
View author publications
You can also search for this author in PubMed Google Scholar
Jiangshan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Mengnan Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Pei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Panyi Li
View author publications
You can also search for this author in PubMed Google Scholar
Yong Hou
View author publications
You can also search for this author in PubMed Google Scholar
Huanming Yang
View author publications
You can also search for this author in PubMed Google Scholar
Guojie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Qiye Li
View author publications
You can also search for this author in PubMed Google Scholar
Chuanyu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Longqi Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.W., L.L., Y.L. and C.L. conceived the idea. T.W., W.L., J.Z. and M.W. collected samples. T.W. dissected brains. M.W. and T.W. generated the data. Z.W., W.J., Y.Y., Y.Yuan, J.S., M.C. and P.L. assisted with the experiments. Y.L. analyzed the data with the assistance of Z.X., and P.Z.. M.W. wrote the manuscript with the input of Y.L. and M.W.. L.L. supervised the study and revised the manuscript. Q.L., G.Z., H.Y, and Y.H. provided helpful comments on this study. All authors reviewed and approved the final manuscript.

Corresponding authors

Correspondence to Chuanyu Liu or Longqi Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary File 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Wang, M., Liu, Y., Wen, T. et al. Chromatin accessibility and transcriptome landscapes of Monomorium pharaonis brain. Sci Data 7, 217 (2020). https://doi.org/10.1038/s41597-020-0556-x

Download citation

Received: 06 January 2020
Accepted: 08 June 2020
Published: 08 July 2020
DOI: https://doi.org/10.1038/s41597-020-0556-x

This article is cited by

A single-cell transcriptomic atlas tracking the neural basis of division of labour in an ant superorganism
- Qiye Li
- Mingyue Wang
- Weiwei Liu
Nature Ecology & Evolution (2022)

Subjects

Abstract

Similar content being viewed by others

A single-cell transcriptomic atlas tracking the neural basis of division of labour in an ant superorganism

The effect of the brood and the queen on early gene expression in bumble bee workers' brains

Gene expression and epigenetics reveal species-specific mechanisms acting upon common molecular pathways in the evolution of task division in bees

Background & Summary

Methods

Experimental design

Animals

Brain collection and RNA extraction

RNA-sequencing library construction

ATAC-seq library preparation

Sequencing

RNA-sequencing dataset processing

ATAC-sequencing dataset processing

Identification of widely expressed or specific genes across the four groups

Comparative analysis across groups

Data Records

Technical Validation

RNA-sequencing metrics and reproducibility

ATAC-sequencing quality control

Comparative analysis between castes

Usage Notes

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary File 1

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A single-cell transcriptomic atlas tracking the neural basis of division of labour in an ant superorganism

Search

Quick links