CellNeighborEX: deciphering neighbor‐dependent gene expression from spatial transcriptomics data

Abstract Cells have evolved their communication methods to sense their microenvironments and send biological signals. In addition to communication using ligands and receptors, cells use diverse channels including gap junctions to communicate with their immediate neighbors. Current approaches, however, cannot effectively capture the influence of various microenvironments. Here, we propose a novel approach to investigate cell neighbor‐dependent gene expression (CellNeighborEX) in spatial transcriptomics (ST) data. To categorize cells based on their microenvironment, CellNeighborEX uses direct cell location or the mixture of transcriptome from multiple cells depending on ST technologies. For each cell type, CellNeighborEX identifies diverse gene sets associated with partnering cell types, providing further insight. We found that cells express different genes depending on their neighboring cell types in various tissues including mouse embryos, brain, and liver cancer. Those genes are associated with critical biological processes such as development or metastases. We further validated that gene expression is induced by neighboring partners via spatial visualization. The neighbor‐dependent gene expression suggests new potential genes involved in cell–cell interactions beyond what ligand‐receptor co‐expression can discover.


Introduction
Cells communicate with their microenvironment in various ways including the release of soluble molecules and direct cell contact (Yang et al, 2021), actively changing their transcriptomes in response to external signals (Cable et al, 2021;Fischer et al, 2022).To gain insight into critical biological processes such as diseases and development, it is essential to understand the various ways of cell-cell communication.Experimental approaches to cell-cell communication usually require an elaborate and intricate setup (Nishida-Aoki & Gujral, 2019).
Genome-scale study on cell-cell interactions has been recently possible by using ligand-receptor co-expression on single cell RNAsequencing (scRNA-seq); Browaeys et al, 2020;Efremova et al, 2020) and spatial transcriptomics (ST) data (preprint: Pham et al, 2020;Garcia-Alonso et al, 2021;Li et al, 2021;Shao et al, 2022).The use of ligand-receptor co-expression enabled inferring interacting cell type pairs and identifying intercellular signaling pathways without relying on a complicated experimental setup.However, it cannot elucidate the gene expression of individual cells changed by direct cell contact.
A growing body of studies on cell communication has demonstrated that cells are influenced by their microenvironment and neighboring cells (Barone et al, 2017;Hannezo & Heisenberg, 2019).Grafting experiments in developing embryos manifested that direct cell contact can induce signals for the development of a specific tissue type (Spemann & Mangold, 2003;Solini et al, 2017).More recently, RNA sequencing of physically interacting cells (PIC-seq) has revealed that cells express different genes depending on the types of neighboring cells during mouse development (Kim et al, 2023).This study suggests that cells have distinct expression profiles through direct cell contact independently from ligand-receptor-mediated communication.
Recent development in ST has opened potential ways to explore the role of the microenvironment.The spatial gene expression profile has made it possible to study the transcriptional activity of a cell together with that of the neighborhood within intact tissues.There are largely two types of ST data with their own advantages and limitations.Image-based approaches including MERFISH (Chen et al, 2015) and seqFISH (Lubeck et al, 2014;Eng et al, 2019) use fluorescence in situ hybridization (FISH) to visualize RNA species of interest.While image-based ST approaches can quantify RNAs at cellular resolution, the number of detectable RNA species is still limited.The next-generation-sequencing (NGS)-based ST approaches such as Visium (St ahl et al, 2016) and Slide-seq (Rodriques et al, 2019;Stickels et al, 2021a;Zhao et al, 2022a) leverage spatially barcoded beads.While NGS-based approaches can unbiasedly profile the transcriptome, a barcode can be linked to the mixture of transcriptome of multiple cells or cell portions depending on the position and the resolution of the barcoded spots, making it hard to detect gene expression changed by the cellular microenvironment.
Many computational tools have been developed to understand cell-cell interactions from ST data.CellphoneDB v.3.0 (Garcia-Alonso et al, 2021), MESSI (Li et al, 2021), SpaTalk (Shao et al, 2022), and stLearn (preprint: Pham et al, 2020) use the coexpression of ligand-receptor pairs to study cellular communication.However, ligand-receptor co-expression cannot completely capture cell-cell interactions due to direct contact.SVCA (Arnol et al, 2019) decomposes the sources of gene expression variation into intrinsic effects, environmental effects, and cell-cell interactions.It explains the relationship between gene expression and cell-cell interactions.However, SVCA does not have a function to detect gene expression change associated with cell contact, and their strategy has been only optimized for image-based ST data.As MISTy (Tanevski et al, 2022) quantifies the contributions of different spatial contexts to the expression of markers of interest, the influence of immediate neighborhoods on the expression of markers can be investigated.However, MISTy requires to pre-select the list of marker genes to find potential interactions and it has not been designed to identify gene expression change related to cell contact in an unbiased way.DeepLinc (Li & Yang, 2022) reconstructs a cell interaction network from ST data.Regarding three nearest neighbors as direct contact, DeepLinc finds signature genes contributing to interactions between cell types and infers proximal interactions between them.However, it does not uncover specific relationships between the signature genes and interacting cell types.C-SIDE (Cable et al, 2022) examines up-and down-regulated genes depending on proximity to a certain cell type.Because the interaction between cell types is defined based on cell density rather than cell contact, C-SIDE does not work for studying cell contact-dependent gene expression.NCEM (Fischer et al, 2022) investigates transcriptomic change depending on local environments but it has not been designed to study the influence of cell contact on gene expression, particularly for NGS-based data.NCEM considers one barcoded-spot a single cell type even in low resolution Visium data, so it does not look into the influence of direct contact between multiple cell types within one spot.Although spatial context has been applied to study cell-cell interactions, transcriptomic change associated with cell contact has not been fully explored yet.It is still challenging to detect genes influenced by cell contact regardless of the data types.
Here, we propose a universal approach called CellNeighborEX to identify genes influenced by neighboring cells in ST data.CellNeigh-borEX dissects the transcriptome of cells with their immediate neighbors to categorize cells based on the neighboring cell types.For NGS-based ST data where exact cell locations are not available, CellNeighborEX actively uses the mixture of transcriptome to identify immediate neighbors.CellNeighborEX has been applied to various ST data from mouse embryos, hippocampus, and liver cancer to identify neighbor-dependent genes.These transcriptomic changes have been confirmed in the spatial context.We showed that cells express specific genes depending on their neighboring cell types.The neighbor-dependent gene expression suggests new potential genes involved in cell-cell interactions beyond what ligand-receptor co-expression can discover and gives clues on complex biological processes.

CellNeighborEX categorizes transcriptome to investigate the influence of neighboring cell types
CellNeighborEX defines immediate neighbors differently for imageand NGS-based ST data.In image-based ST data where exact cell locations are available, CellNeighborEX finds the nearest neighbors using Delaunay triangulation (Delaunay, 1934), radial distance, or k-nearest neighbors (KNN; Fix & Hodges, 1951;Cover & Hart, 1967;Fig 1A).Both radial distance and KNN require prior knowledge.KNN is useful for cells evenly distributed on a tissue.Radial distance is useful when we know the distance between the cells of interest.Delaunay triangulation can be used alone or in combination with radial distance.
CellNeighborEX classifies all cells into two groups based on the cell types of the nearest neighbors.Homotypic neighbors consist of the same cell type.Heterotypic neighbors are composed of different cell types.The influence of neighboring cells can be measured by comparing the transcriptome of heterotypic neighbors against that of homotypic neighbors (log ratio > 0.4, P-value < 0.01, FDR < 0.05).Parametric (i.e., Student's t-test, Welch's t-test) or non-parametric (i.e., Mann-Whitney U test) statistical tests are performed as differential expression (DE) analysis depending on the sample size of the groups (see Materials and Methods).This strategy also works for the NGS-based ST approach with high resolution (< 1 μm resolution) after cell segmentation.
The locations of cells are not explicitly given for NGS-based ST approaches and a barcoded spot contains the mixed transcriptome from multiple cells.CellNeighborEX capitalizes on spots with multiple cell types (or heterotypic spots) as the evidence for cell contact (Fig 1B).This strategy is effective in studying neighbor-dependent gene expression when the diameter of a spot is near cellular resolution.For instance, Slide-seq, a ST approach with 10 μm resolution, has homotypic spots (the same cell type in a bead) and heterotypic spots (two or more cell types in a bead), and the majority of the heterotypic spots are composed of two cell types (97% of the spots are composed of one or two cell types) (Rodriques et al, 2019).Cell-NeighborEX uses RCTD (Cable et al, 2021) to decompose the cell  .The artificial heterotypic spots are created by merging two random homotypic spots using information on the heterotypic spots' cell type proportions estimated by RCTD.The artificial heterotypic spots represent two different cell types just combined without cell-cell interactions.We confirmed that our neighbor-dependent genes were also differentially expressed against these artificial spots (FDR < 0.01).
For the 354 up-regulated genes, we carried out Gene Ontology (GO) analysis using the enrichGO function from the clusterProfiler R package.We selected top 20 biological process GO terms based on the adjusted P-values.Here, the P-value represents the probability of observing the association between the set of genes and the GO terms by chance.We found that the top 20 GO terms were associated with biological terms such as "pattern specification process" and "embryonic organ morphogenesis" (Fig EV1C ), suggesting that cell contact with other cells-or tissue types may regulate the coordinated developmental processes.
CellNeighborEX detects neighbor-dependent genes in slide-seq data from a mouse embryo We analyzed Slide-seq V2 data from a mouse embryo.For this NGSbased data, CellNeighborEX uses heterotypic beads to define immediate neighbors.To find heterotypic spots, we applied the RCTD (Cable et al, 2021) deconvolution tool using single cell RNAsequencing (scRNA-seq) data from a mouse embryo (Cao et al, 2019a;Data ref: Cao et al, 2019b) as a reference.As a result, RCTD identified 8,094 homotypic and 34,268 heterotypic spots in the mouse embryo.The decomposed cell types of the heterotypic spots were additionally validated by the expression of cell type markers and correlation analysis (see Materials and Methods).We confirmed whether the respective cell type markers are expressed in the annotated heterotypic spots.We also examined correlation between true heterotypic spots and artificial ones.We generated many combinatorial types of artificial heterotypic spots (Appendix Fig S2).We checked if the cell types of true heterotypic spots are the same as those of artificial heterotypic spots with the largest correlation.For the validated heterotypic spots, comparing the gene expression levels of heterotypic spots with the expression of homotypic spots, CellNeighborEX detected neighbor-dependent genes.Finally, CellNeighborEX found 28 up-regulated genes from 9 heterotypic pairs, and 28 down-regulated genes from 11 heterotypic pairs in the embryo (Dataset EV1C and D).
For example, CellNeighborEX identified 17 genes including Cd24a, which are highly expressed in the heterotypic spots of Endothelial and Lens cells compared with their respective homotypic spots (Fig EV2A).The heterotypic spots expressed marker genes in both Endothelial (Pecam1 and Egfl7) and Lens cells (Cryba1 and Cryaa).We confirmed in the spatial visualization that Cd24a is more highly expressed in the heterotypic spots of Endothelial+Lens cells (red boundaries) compared with the homotypic spots of Endothelial (blue) and Lens (black) cells (Fig EV2B).
To find from which cell type the expression of Cd24a comes between Endothelial cells and Lens cells, we used a regression model in which gene expression is shown along the proportion of one cell type in the heterotypic spots (see Materials and Methods).If gene expression increases as the proportion of one cell type grows, we regarded the cell type as the source of the expression.Otherwise, the other cell type was considered the origin of the expression.We found that the expression level of Cd24a increased as the proportion of Endothelial cells (against Lens cells) increased (Appendix Fig S3A).We further confirmed that Cd24a is expressed mainly from Endothelial cells when examining scRNA- A In image-based ST data, immediate neighbors for cell contact are determined by algorithms such as Delaunay triangulation, radial distance, and KNN.Based on their cell types, homotypic and heterotypic neighbors are defined.CellNeighborEX detects genes influenced by neighbors by comparing the transcriptome of heterotypic neighbors with that of homotypic neighbors.B In NGS-based ST data, there are homotypic spots (the same cell type in a bead) and heterotypic spots (multiple cell types in a bead).The heterotypic spots are regarded as evidence for cell contact.CellNeighborEX compares the heterotypic spots with the homotypic ones to detect neighbor-dependent genes.Additional statistical tests with null models are applied for validation.
seq data from a mouse embryo (Cao et al, 2019a;Data ref: Cao et al, 2019b;Fig EV2C).The Cd24a (CD24) protein is known as a ligand for P-selectin and a cell adhesion molecule (CAM) involved in cellular binding in Endothelial cells (Sammar et al, 1994).
For the 28 up-regulated genes, the GO analysis showed terms related to tissue development including "eye development" and "digestive system development" (Fig EV2D), further suggesting the role of cell contact in the developmental processes.
CellNeighborEX detects genes influenced by the tumor microenvironment (TME) in slide-seq data from mouse liver cancer For Slide-seq data in mouse liver metastases, RCTD used singlenucleus RNA-sequencing (snRNA-seq) data in mouse liver cancer (Zhao et al, 2022a;Data ref: Zhao et al, 2022b) as a reference.It identified 16,557 homotypic and 6,284 heterotypic spots.After validating the cell type annotation of heterotypic spots (see Materials and Methods-Appendix Fig S2 ), CellNeighborEX detected 42 upregulated genes from 10 heterotypic pairs and 3 down-regulated genes from 2 heterotypic pairs (Dataset EV1E and F).
For instance, CellNeighborEX found that F13a1 is highly expressed when Monocyte cells contact Tumor cells (Fig EV3A).Its spatial mapping displays the higher expression in the heterotypic spots than the homotypic ones (Fig EV3B).Using the regression model and snRNA-seq data, we checked that the expression of F13a1 is derived from Monocyte cells (Appendix Fig S3B and Fig EV3C).Previous global proteomic analysis on small extracellular vesicles identified that F13a1 is associated with liver cancers (Dong et al, 2022).Besides, F13a1 promotes lung squamous cancer (Porrello et al, 2018) and is a biomarker for colorectal cancers (Peltier et al, 2018).
For the 42 up-regulated genes, the GO analysis showed terms such as "collagen fibril organization" and "endothelial cell migration" (Fig EV3D).Specifically, Col1a2, Col3a1, Ext1, and Tgfbr1 are associated with collagen fibril organization.Col1a2 and Col3a1 are more expressed when Vascular smooth muscle cells contact Tumor III cells (VSMC+Tumor III).Ext1 and Tgfbr1 are more expressed when Tumor III cells contact Hepatocyte II cells (Tumor III + Hepatocyte II).These four genes are all highly expressed when non-tumor and tumor cells interact with each other.Based on previous studies showing that collagen influences cancer cell behaviors such as metastasis, tumorigenesis, and proliferation (Xu et al, 2019), we can infer that the four genes might affect liver cancer by controlling the collagen arrangement among VSMC, Hepatocyte, and Tumor cells.A vitro study actually showed that the down-regulation of Col1a2 suppressed hepatocellular cancer (Ji et al, 2010), which supports that neighbor-dependent genes are indeed associated with critical biological processes.It is notable that CellNeighborEX can detect genes influenced by TME in an unbiased way.

CellNeighborEX discovers neighbor-dependent genes in slide-seq data from mouse brain
We also analyzed Slide-seq V2 data from mouse brain.Using scRNA-seq data for mouse hippocampus (Saunders et al, 2018a;Data ref: Saunders et al, 2018b) as a reference, we ran RCTD (Cable et al, 2021).In total, 12,013 homotypic and 29,331 heterotypic spots were identified in the mouse hippocampus.We additionally validated the decomposed cell types of the heterotypic spots through the expression of cell type markers and correlation analysis (see .CellNeighborEX found 155 up-regulated genes from 21 heterotypic pairs and 55 downregulated genes from 8 heterotypic pairs for this dataset (Dataset EV1G and H).
For example, CellNeighborEX detected 3 up-regulated genes in the heterotypic spots of Endothelial tip (EnT) and Astrocyte cells including Fabp7 (Fig 2A).Its spatial visualization shows that Fabp7 is more expressed in the heterotypic spots than the respective homotypic ones (Fig 2B).Using the scRNA-seq data from hippocampus (Saunders et al, 2018a;Data ref: Saunders et al, 2018b), we examined the potential cell types expressing these genes (Fig 2C).For the 155 up-regulated genes, the GO analysis showed terms such as "regulation of metal ion transport" and "dopamine secretion", suggesting that cell contact may contribute to neuronal processes in the mouse brain (Fig 2D).
To validate the accuracy of our findings, we designed a set of experiments to observe the expression of the neighbor-dependent genes, Trf, Fabp7, and Dbi, which were shown to be upregulated in our computational predictions when Astrocyte are adjacent to EnT cells.We isolated Astrocyte and EnT cells from mouse hippocampus as demonstrated graphically by Fig 3A.This strategy leverages specific cell membrane markers such as CD34-positive, VEGFR2positive, and VEGFR1-negative for EnT cells and ACSA2-positive and CD11b-negative for Astrocyte (Fig 3B).We used two different approaches to analyze the expression of Trf, Fabp7, and Dbi via qPCR-based mRNA expression analysis of Astrocyte and EnT cells isolated from the mouse hippocampus.In our first approach, we labeled Astrocyte with GFP using comet-pD2109-CMV lentiviral particles.Next, we used monocultures of EnT cells, Astrocyte GFPpositive, and 48 h co-cultures of these two cell types in a 1:1 ratio.After sorting Astrocyte and EnT cells based on their GFP expression (Fig 3C ), we used monocultured and co-cultured Astrocyte and EnT cells to determine the expression of Trf, Fabp7, and Dbi.Through this experimental approach, we observed that the expression of Trf To show that the expression of our tested genes was due to not the release of cellular factors but direct cell contact, we performed a media exchange assay in which we cultured EnT cells with Astrocyte-derived media, and cultured Astrocyte with EnT .Then, we analyzed the expression of Trf, Fabp7, and Dbi.In these conditions, we did not observe significant changes in their expression of these genes compared to their expression in the mouse hippocampus when Astrocyte and EnT cells were cultured in their respective media (Fig EV4B To demonstrate the usefulness of using the heterotypic spots in Slide-seq, we additionally ran NicheNet on the heterotypic spots.We set the heterotypic spots as a receiver as well as a sender (autocrine mode in NicheNet).We found that 2 genes (of 28 up-regulated genes from 9 heterotypic pairs) in the embryo, 15 genes (of 42 up-regulated genes from 10 heterotypic pairs) in the liver cancer, and additional 11 genes (of 155 up-regulated genes from 21 heterotypic pairs) in the hippocampus common to our neighbor-dependent genes (Dataset EV3), suggesting that the use of heterotypic beads are useful in identifying genes related to cell communication.

Neighbor-dependent genes demonstrate niche-specific expression
We further examined if neighbor-dependent genes are able to show that cells express specific sets of genes depending on their niches.By running CellNeighborEX on the mouse embryo seqFISH data (Fig 4A ), we found that Gut tube cells highly express Tbx1 when adjacent to Cranial mesoderm, Pitx1 when adjacent to Neural crest, and Foxf1 when adjacent to Splanchnic mesoderm in the mouse embryo (Fig 4B).To investigate niche-specific gene expression, we colored the boundary of Gut tube cells based on its neighboring cell types: red when proximal to Cranial mesoderm, green when to Neural crest, blue when to Splanchnic mesoderm, and orange when to another Gut tube.To easily distinguish gene expression change depending on neighboring cell types, we defined the neighboring cell type-specific genes using RGB color channels (see Materials and Methods): Tbx1 (red), Pitx1 (green), and Foxf1 (blue).Then, we represented the expression of these three genes using the combination of each color channel.Among the three genes, Tbx1 (red inside the boundary) is dominantly expressed when Gut tube cells are next to Cranial mesoderm (Gut-tube/Cranial mesoderm, red boundary), Pitx1 (green inside the boundary) is dominantly expressed when Gut tube cells are next to Neural crest (Gut-tube/Neural crest, green boundary), and Foxf1 (blue inside the boundary) is dominantly expressed when Gut tube cells are next to Splanchnic mesoderm (Gut-tube/Splanchnic-mesoderm; Fig 4B).These results indicate that Gut tube cells vary the expression levels of these genes depending on their neighboring cell types.
◀ Figure 3. Validation of neighbor-dependent gene expression predictions upon co-culture of cells derived from mouse hippocampus.
A Model depicting the experimental methodology applied to analyze the expression of the predicted neighbor-dependent genes in the mouse hippocampus.Briefly, we isolated Astrocyte and EnT cells from mouse hippocampus through cell sorting with specific cell membrane markers for these cell types.Then, we labeled Astrocyte with GFP and performed monocultures of Astrocyte and EnT as well as co-culture of both cell types for 48 h, followed by sorting of Astrocyte and EnT cells based on their GFP expression.The sorted cells and the monocultures were used to perform RT-qPCR to determine the expression of the predicted neighbor-dependent genes Trf, Fabp7, and Dbi.

Niche specific-gene expression accounts for cellular heterogeneity
Cellular heterogeneity is caused by a number of reasons in different contexts.scRNA-seq is useful in describing cell heterogeneity but cannot explain the cause of heterogeneity.We tested if the niche specific genes obtained from ST study can account for cellular heterogeneity in scRNA-seq.We selected a cell type showing neighbordependent gene expression.Figure EV5A shows an example of neighboring cell type-dependent gene expression in the hippocampus Slide-seq V2 data.Endothelial tip cells dominantly express Igfbp7 (red) when proximal to Choroid (EnT + Ch), Trf (green) when proximal to Astrocyte (EnT + A), and Plp1 (blue) when proximal to Interneuron (EnT + In).In the hippocampus scRNA-seq data, it was confirmed that the three genes were mostly expressed from EnT (Fig EV5B).
To investigate the heterogeneity of EnT cells, we separately selected EnT cells in the scRNA-seq data.For the EnT cells, we performed clustering analysis using the expression values of the three genes and identified 4 clusters (Fig EV5C).The UMAP plot shows the expression of Igfbp7, Trf, and Plp1 in the corresponding clusters.We further labeled them based on the neighboring cell type information (Fig EV5D).We also found an additional example for the heterogeneity of Interneuron cells in the hippocampus Slide-seq V2 data (Appendix Fig S7).These findings suggest a possibility that cellular heterogeneity might be stemmed from neighboring cell typedependent gene expression.

Discussion
Cell communication is a fundamental process related to various functions such as cell growth, development, and diseases (Yang et al, 2021).Cell communication coordinates the functions of multicellular organisms (Radhakrishnan et al, 2010).Even with its importance, a systematic study of cell communication was not easy without a well-curated experimental setup until scRNA-seq is available.To study cell-cell interactions in scRNA-seq data, coexpression of ligand-receptor pairs has been used (Browaeys et al, 2020;Efremova et al, 2020).However, it was not possible to study other types of cell communication such as direct contact due to the loss of spatial information in scRNA-seq.
RNA sequencing of physically interacting multi-cells or PIC-seq has provided the transcriptomic landscapes of cells contacting each other (Boisset et al, 2018;Giladi et al, 2020;Kim et al, 2023).In our previous study using PIC-seq from mouse embryos, we found that direct cell contact can induce the expression of specific gene sets depending on the neighboring cell types (Kim et al, 2023).This unbiased approach systematically studied cell contact-dependent expression during mouse development.However, PIC-seq can be biased to cell interactions more strongly bound to each other.
ST technologies have enabled spatial mapping of gene expression and have provided opportunities to look into cellular microenvironments.Accordingly, a number of computational methods that investigate cell-cell interactions in the spatial domain have been developed (Arnol et al, 2019;preprint: Pham et  A The spatial mapping with RGB channels displays the simultaneous expression of neighbor-dependent genes in the mouse embryo.Primitive erythroid lineage (PEL) cells dominantly express Hba-a1 (red) when contacting Definitive erythroid lineage (DEL), and Cdkn1c (green) when contacting Limb mesenchyme (LM).B In the mouse embryo scRNA-seq data, Hba-a1 and Cdkn1c are expressed from PEL. C In the Spatial mapping for the mouse hippocampus, Entorhinal (Ento) cells express Nnat (red) when contacting Choroid (Ch), Gda (green) when contacting Neuron.Slc17a6 (N), and Atp2b1 (blue) when contacting Interneuron (In).D In the mouse hippocampus scRNA-seq data, Nnat, Gda, and Atp2b1 are mostly expressed from Ento.E In the spatial mapping for the mouse liver cancer, Monocyte cells dominantly express Marco (red) when contacting Hepatocyte I, Vti1a (green) when contacting Hepatocyte II, and F13a1 (blue) when contacting Tumor III.F In the mouse liver cancer snRNA-seq data, Marco, Vti1a, and F13a1 are expressed from Monocyte cells.For NGS-based ST data where exact cell locations are not available, we used the heterotypic beads.
To leverage CellNeighborEX effectively, there are three considerations to keep in mind.Firstly, CellNeighborEX works for ST data with single cell or near-cellular resolution.Using heterotypic beads, it successfully identified neighboring cell type-dependent genes in Slide-seq with 10 μm resolution.However, it is not suitable to detect neighbor-dependent genes in ST data with low resolution such as Visium (St ahl et al, 2016) as there could be more than 2 cell types.For the higher resolution ST data such as Seq-Scope (Cho et al, 2021), it is possible to use heterotypic beads by rescaling them to 10 μm resolution.Second, careful cell type annotation is needed.The identification of cell contact-dependent genes is heavily affected by cell type annotation as CellNeighborEX categorizes cells according to defined cell types.To minimize yielding false positive, it is recommended to validate annotated cell types by confirming the expression of cell type markers and investigating correlations between true heterotypic spots and artificial heterotypic ones.Third, the number of neighbor-dependent genes can vary across the datasets.Image-based approaches such as seqFISH provide more accurate measurement of RNA quantity against NGS-based ones, which can allow more statistical power in detecting neighbor dependentgenes.However, the number of detectable genes is limited in the image-based approaches, making it hard to detect unknown genes influenced by neighbors.
CellNeighborEX identified neighbor-dependent genes in various ST datasets including mouse embryos (Figs EV1 and EV2), liver cancer (Fig EV3 ), and mouse hippocampus (Fig 2).Our results indicate that neighbor-dependent genes are found in most of the cell types and tissues, further expanding our observation in the developing mouse embryos (Kim et al, 2023).
Interestingly, we found that genes influenced by neighbors were associated with important cell functions.For instance, CellNeigh-borEX found that neighbor-dependent genes are associated with embryonic development in both seqFISH (Fig EV1D ) and Slide-seq V2 (Fig EV2D) data.Our results may suggest that cell contact triggers genes important for further development.CellNeighborEX also provided information about cell types and genes influenced by TME.For instance, CellNeighborEX found that F13a1 is highly expressed when Monocyte cells contact Tumor cells.F13a1 has been known to be associated with various cancers including liver cancer (Peltier et al, 2018;Porrello et al, 2018;Dong et al, 2022).In the GO analysis, we found that the neighbor-dependent genes are further associated with cancer metastases (Fig EV3D).Our results show that CellNeighborEX is a useful tool to study the influence of TME in an unbiased way.Besides, the experiment using a co-culture system clearly demonstrated the higher expression of neighbordependent genes (Fig 3D).It is also of note that we saw a little overlap with the results obtained from ligand-receptor pairs (Appendix Fig S4).These findings indicate that studying direct cell contact is important to understand cell-cell interactions more thoroughly.
We used heterotypic beads as evidence for cell contact in NGSbased ST data.The neighbor-dependent genes that we identified in Slide-seq data were more highly expressed than artificially generated null models (Appendix Fig S1).From the regression model using the heterotypic beads, we predicted cell types expressing the neighbor-dependent genes (Appendix Fig S3).Our strategy suggests new ways to utilize heterotypic beads in the high-resolution ST data such as Slide-seq.
We observed that cells express specific sets of genes depending on their neighbors (Figs 4 and 5).This niche-specific gene expression partly explains the cause of cellular heterogeneity shown in scRNA-seq data (Fig EV5 and Appendix Fig S7).Also, this suggests that we can annotate sub-clusters of cells based on their neighboring cell types, and further predict neighboring cell types of cells using niche-specific genes.
ST information has been used to understand cell communication.NCEM (Fischer et al, 2022) is a graph neural network model to investigate the influence of neighboring cells on gene expression.NCEM uses neighbors in an intermediate range while we focus on immediate neighbors, thereby having lower complexity of neighboring cell types.That allows studying the influence of direct contact between two different cell types.Based on this, the gene expression changes can be explicitly validated in the spatial domain, suggesting niche-specific expression.
To sum up, CellNeighborEX is a new approach to explore transcriptomic changes caused by direct cell contact in ST data.Studying cell contact-dependent gene expression provides opportunities to understand cell-cell interactions between two adjacent cells from a new perspective.It enabled the identification of new genes potentially involved in intercellular communication beyond previous approaches that use ligand-receptor pairs.It also demonstrated gene expression varies depending on neighboring cell types, explaining cellular heterogeneity.Oligonucleotides and sequence-based reagents

Data preprocessing
For seqFISH data from a mouse embryo (Lohoff et al, 2022), we used the gene expression data and annotated cell types preprocessed by Squidpy (Palla et al, 2022a;Data ref: Palla et al, 2022b).For Slide-seq data from a mouse embryo (Stickels et al, 2021a;Data ref: Stickels et al, 2021b), hippocampus (Stickels et al, 2021a;Data ref: Stickels et al, 2021b), and liver cancer (Zhao et al, 2022a;Data ref: Zhao et al, 2022b), we obtained them from Puck_190926_03, Puck_200115_08, and mouse_liver_met_2_rna_201002_04, respectively.For the embryo and liver cancer, samples that have unique feature counts < 200 were filtered out.For the hippocampus, all samples were used without the filtering because the samples with unique feature counts < 200 take up a considerable percentage of the total samples (i.e., about 40%).The values of the count matrix per dataset were log-normalized and then top 2000 variable genes as well as cell type markers (Dataset EV4) were selected.
For liver cancer snRNA-seq data, we pre-processed paired dataset (Zhao et al, 2022a;Data ref: Zhao et al, 2022b) given with the Slideseq data from the mouse liver cancer.The genes expressed in < 3 nuclei were filtered out.The samples that have unique feature counts < 200 and mitochondrial RNA larger than 1 % were filtered out.Additionally, approximately 10 percent of doublets detected by DoubletFinder (McGinnis et al, 2019) were removed.After clustering analysis, we annotated cell clusters using the information of cell type markers accompanied with the snRNA-seq dataset.The analyses mentioned above were all performed with Seurat 3.2.2(Satija et al, 2015).

Cell type inference of slide-seq spots
We used RCTD (Cable et al, 2021) to identify the cell types of spots in Slide-seq.To run RCTD, we trained RCTD with scRNA-seq or snRNAseq datasets with annotated cell types.For the embryo, we used a scRNA-seq dataset (Cao et al, 2019a;Data ref: Cao et al, 2019b) at E12.5 equivalent to the developmental stage of Slide-seq embryo.It consists of 26,183 genes and 270,197 cells assigned into 37 cell types (Appendix Fig S8A).For the hippocampus, we obtained a scRNA-seq dataset (Saunders et al, 2018a;Data ref: Saunders et al, 2018b) from DropViz.It is composed of 27,953 genes and 113,507 cells assigned into 17 cell types (Appendix Fig S8B).For the liver cancer data, we used the paired snRNA-seq dataset (Zhao et al, 2022a;Data ref: Zhao et al, 2022b) given with the Slide-seq data from mouse liver cancer.The pre-processed snRNA-seq dataset consists of 24,098 genes and 11,683 nuclei assigned into 14 cell types (Appendix Fig S8C).Training RCTD, we predicted the cell types of spots in Slide-seq.The simulation was performed under doublet mode (Cable et al, 2021) that constraints each spot to contain up to two cell types, which is recommended for data with fine resolution such as Slide-seq.Using RCTD, we identified the cell types of spots and further estimated the cell type proportions for each spot.
The results on the inferred cell types were additionally validated through correlation analysis.We examined correlation between the true heterotypic spots annotated by RCTD and artificial heterotypic spots.Specifically, we generated artificial heterotypic spots by combining two homotypic spots based on the cell type proportions of heterotypic spots obtained by RCTD (Appendix Fig S1).
With repeated random sampling, we created 100 artificial heterotypic spots for each heterotypic spot.Next, we calculated Pearson's correlation coefficients based on the gene expression values between the true heterotypic spots and many combinatorial types of artificial heterotypic spots (Appendix Fig S2).If the cell types of true heterotypic spots are consistent with those of artificial ones with the largest Pearson's coefficient, we regarded the cell types of the true heterotypic spots as validated.On top of the correlation analysis, the inferred cell types were validated by cell type markers (Dataset EV4) accompanied with the scRNA-seq (Cao et

CellNeighborEX-Neighbor-dependent gene expression analysis
We studied neighbor-dependent gene expression by comparing heterotypic groups (heterotypic neighbors for seqFISH, heterotypic spots for Slide-seq) with homotypic groups (homotypic neighbors for seqFISH, homotypic spots for Slide-seq).We identified genes upand down-regulated by direct cell contact.We carried out rigorous statistical analysis between the two groups.We determined whether to use parametric or non-parametric two-sided tests depending on the sample size of groups.Additionally, the normality test using the Shapiro-Wilk test is also available in CellNeighborEX.When both samples were larger than sample size 30, we chose parametric tests under the normality assumption.To be specific, we conducted the Student's t-test for equal variances and the Welch's t-test for unequal variances, where the two-sample F-test was used to test whether the variances are equal or not.
For the Student's t-test, the t statistic is calculated as follows: where X i is the mean of expression values in group i (i = 1,2).In the equations, group 1 and group 2 represent heterotypic group and homotypic group, respectively.n i is the sample size of group i. s p is the pooled standard deviation of the two groups: s where s i is the standard deviation of group i.
For the Welch's t-test, the t statistic is computed as follows: Meanwhile, when the sample size of at least one sample was smaller than 30, we performed the Mann-Whitney U test as a nonparametric test.The U statistic is calculated as follows: where R 1 is the sum of the ranks for the heterotypic group, and R 2 is the sum of the ranks for the homotypic group.
In DE analysis, the log ratio > 0.4 and P-value < 0.01 were used as criteria for differential expression.For the seqFISH data, FDR < 0.05 was added as an additional criterion.

CellNeighborEX-Verification of cell-cell interactions in the heterotypic spots of slide-seq data
We developed a null model to verify that individual heterotypic spots represent two different cell types interacting with each other.Our null model refers to artificial heterotypic spots (Appendix Fig S1).In contrast with the true heterotypic spots, the artificial heterotypic spots indicate two different cell types just combined without cell-cell interactions.We compared the true heterotypic spots with the artificial heterotypic spots to confirm the statistical significance of the neighbor-dependent genes.The significant neighbor-dependent genes mean that their expression resulted from interacting two cell types.The same statistical tests as the neighbor-dependent gene expression analysis were applied.The log ratio > 0.4, P-value < 0.01, and FDR < 0.01 were used as criteria for differential expression.
Finding cell types expressing neighbor-dependent genes in the heterotypic spots of slide-seq Heterotypic spots of Slide-seq represent interacting two different cell types.For the genes up-regulated by cell contact, it is challenging to find from which cell type the expression of the neighbor-dependent genes comes between the two cell types.We created linear regression models to find the origin of the expression.For instance, we suppose that g is a neighbordependent gene found from the heterotypic spots of A þ B. For n data pairs x i ; y i ð Þ; i ¼ 1; 2; . . .; n f g , the regression model is as follows: where n is the number of A þ B heterotypic spots expressing g. x i is the proportion of A, 1Àx i is the proportion of B, and y i is the expression value of g in heterotypic spot A þ B i .We use the ordinary least squares method to find the intercept (b α) and slope ( b β) illustrating the best fit line to the n data pairs.Estimated b α and b β are computed as follows: where x and y are the mean of the x i and y i , respectively.If slope b β is positive, it indicates that the expression value of g increases as the proportion of cell type A grows.From this, we can infer that the expression of g comes from cell type A. If b β is negative, the expression of g decreases as the proportion of cell type A becomes larger.It means that the expression of g increases as the proportion of cell type B grows.That is, the expression of g comes from cell type B.
We applied the regression models to the neighbor-dependent genes obtained in Slide-seq data when the number of the heterotypic spots is large enough.We additionally used scRNA-seq or snRNAseq data (Appendix Fig S9) and found that the cell types inferred from the single cell or single nucleus data are considerably consistent with the cell types predicted from the statistically significant regression models (i.e., P-value < 0.05; Appendix Fig S3 and  Dataset EV5).

Spatial visualization with RGB color channels
We used RGB coordinates (r, g, b) composed of values between 0 and 1 to observe how gene expression varies depending on neighboring cell types.We first selected one cell type and then collected multiple heterotypic groups where the selected cell type is included.For example, if the cell type of interest is A, heterotypic groups A þ B, A þ C, and A þ D are collected.If gene γ, gene δ, and gene ρ are neighbor-dependent genes found from the heterotypic spots of A þ B, A þ C, and A þ D respectively, red, green, and blue channels are assigned to the three neighbor-dependent genes: gene γ is red (R), gene δ is green (G), and gene ρ is blue (B).To normalize the expression values between 0 and 1, we divided them by the maximum expression value in each heterotypic group.In case that there are two genes, the value of the blue channel is fixed as zero.The expression of the three neighbordependent genes is simultaneously visualized by the RGB channels.

Identification of ligands, receptors, and downstream targets
We used NicheNet (Browaeys et al, 2020) to compare our neighbordependent genes with already known ligands, receptors, and their target genes.We ran NicheNet on the seqFISH and Slide-seq data to see if NicheNet can detect our neighbor-dependent genes.For the seqFISH, we investigated cells corresponding to the heterotypic neighbors (22 types of heterotypic neighbors in the embryo) where our neighbor-dependent genes were identified.We set the centered cell type as a sender and the neighboring cell type as a receiver, vice versa.In the case of Slide-seq, we used homotypic spots corresponding to the heterotypic pairs (9 heterotypic pairs in the embryo, 10 heterotypic pairs in the liver cancer, and 21 heterotypic pairs in the hippocampus).We set the respective homotypic spots as a sender and a receiver by turns.
To validate if the heterotypic spots in Slide-seq are useful to study cell-cell interactions, we examined the heterotypic spots.We set the heterotypic spots as a receiver as well as a sender (autocrine mode in NicheNet).Highly expressed ligand-receptor-target genes detected by NicheNet mean being expressed in at least 10% of cells in one cluster.

Flow cytometry and cell sorting of astrocyte and endothelial tip populations
We used C57BL/6 male mice obtained from Envigo.All animal studies were conducted using a protocol approved by the Virginia Commonwealth University Institutional Animal Care and Use Committee.Cells from the male C57BL/6 mouse hippocampus were isolated and incubated with cell surface antibodies specific to Endothelial Tip (EnT) and Astrocyte cell populations.Following the incubation, these populations were sorted using Cytek Aurora Cell Sorter, and data were analyzed by Spectraflo version 1.0.5 software as follows.FSC/SSC gates were used to define a homogeneous population, and FSC-H/FSC-A gates were used to sort singlets exclusively.For the purpose of isolating the EnT cells, the hippocampus cells were suspended in FACS buffer 2 (PBS, 1 mM EDTA, 25 mM HEPES pH 7, 2% FBS) and stained with CD34: FITC (BD Biosciences #562608), VEGFR2: PE (BD Biosciences #121905), and VEGFR1:Alexa594 (Novus Biological #NB100-527AF594).The VEGFR1-negative cells were selected using the SSC/VEGFR1 gate as described previously (Suchting et al, 2007), from which CD34/VEGFR2 (Siemerink et al, 2013) gate was used to select double-positive populations.For the purpose of isolating the Astrocyte cells, the hippocampus cells were suspended in FACS buffer 2 and stained with ACSA2: APC (Miltenyi Biotec #130-117-535) and CD11b: BUV395 (BD Biosciences #565976).Here, the SSC/CD11b gate was used to select CD11b negative (Pan & Wan, 2020) cells.Then we used the SSC/ACSA2 to specifically isolate Astrocyte cells by selecting the ACSA2 positive population (Pan & Wan, 2020).Isolated EnT and Astrocyte cells were maintained in culture using the Human Endothelial Serum Free Medium and Astrocytes Medium (Thermo Fisher) at 37°C, 5% CO2 incubator.A subpopulation of the Astrocyte cells was infected with 109 TU/ml of GFP comet-pD2109-CMV lentiviral particles (ATUM) and 5 μg/ml of polybrene.After 24 h infection, GFP-positive A cells were selected by GFP expression using Cytek Aurora Cell Sorter.Following a 48 h co-culture of GFP+ Astrocyte and GFP-EnT cells in a 1:1 ratio, the Cytek Aurora Cell Sorter was used to sort the GFPpositive Astrocyte cells and GFP-negative EnT cells according to their GFP expression using SSC/GFP gate.Alternatively, we performed a media exchange experiment in which EnT cells were cultured in Astrocyte cells-derived media while Astrocyte cells were cultured in EnT cells-derived media.

Endothelia tip and astrocyte contact-specific gene quantification by RT-qPCR
We isolated total RNA from EnT and Astrocyte cells individual cultures, cocultures, and cell exchange cultures using the PureLink RNA Mini kit, as per the manufacturer's instruction, and eluted total RNA in 50 μl RNase/DNase-free H2O.Then, we reverse-transcribed to cDNA 10 ng of total RNA using Superscript Vilo cDNA synthesis kit.Finally, we performed real-time PCR (qPCR) in QuantStudioTM 5 (Applied Biosystems) using PowerUp SYBR Green master mix (Thermo Fisher Scientific) and the following reaction conditions.The initial denaturation step was performed at 95°C for 2 min, followed by 40 cycles of 95°C for 15 s and 60°C for 60 s.We used the comparative C T method (ΔΔC t ) to quantify relative gene expression, normalizing the expression of our target genes with the housekeeping gene Gapdh.All samples were run using the following primers: Gapdh: 5 0 -CATCACTGCCACCCAGAAGACTG-3 0 (F) and 5 0 -ATGCCAGTGAGCTT CCCGTTCAG-3 0 (R); Fabp7: 5 0 -TGGGAAACGTGACCAAACCA-3 0 (F) and 5 0 -AGCTTGTCTCCATCCAACCG-3 0 (R); Trf: 5 0 -AGACTTCGAGTT GCTCTGCC-3 0 (F) and 5 0 -CAGAAATTGCCGGTGCAGTC-3 0 (R); and Dbi: 5 0 -TGCGCTCTGTGACTTGATTG-3 0 (F) and 5 0 -ATCGCCCACAGT AGCTTGTTT-3 0 (R).All genes were analyzed using 5 biological replicates.We conducted Shapiro-Wilk test to determine if the gene expression values followed a normal distribution and then performed the Student's t-test for equal variances and the Welch's t-test for unequal variances, where the two-sample F-test was used to test whether the variances are equal or not.The statistical analysis for gene expression was performed with a confidence interval of 95%.

Analyzed publicly available datasets
• Pre-processed subset seqFISH data in a mouse embryo: Squidpy (https://squidpy.readthedocs.io/en/stable/api/squidpy.datasets.seqfish.html).A The heterotypic spots of Endothelial and Lens cells (En + L) (n = 120) were compared with the respective homotypic spots (En [n = 85], L [=52]; log ratio > 0.4, Pvalue < 0.01) and additionally with the artificial heterotypic spots (FDR < 0.01).The statistical test was chosen among the two tailed Student's t-test, Welch's t-test, or Wilcoxon rank sum test depending on the sample size and heterogeneity of variance test.The volcano plot displays 17 up-regulated genes including Cd24a.In the heatmap, the genes are more highly expressed in En + L. The heterotypic spots also express both En and L markers.B The spatial visualization shows the higher expression level of Cd24a in En + L. C In the mouse embryo scRNA-seq data, it was confirmed that Cd24a is expressed in En.D In GO analysis, the GO terms for the genes up-regulated by neighbors are associated with embryonic development.The bar plot shows top 20 GO terms.

Hyobin Kim et al
Molecular Systems Biology A Model depicting the experimental methodology applied to analyze the expression of the predicted neighbor-dependent genes in the mouse hippocampus upon the exchange of cell-derived medias.Briefly, the isolated Astrocyte and Endothelial Tip (EnT) cells from mouse hippocampus were cultured separately for 72 h.Then, the media derived from Astrocyte was transferred to culture EnT cells, while the media from EnT cells was used to culture Astrocyte for 48 h.The cells were harvested and used to perform RT-qPCR.This model validates the expression of the predicted neighbor-dependent genes Trf, Fabp7, and Dbi.B The analysis of qPCR-based mRNA expression of Astrocyte (red bars) and EnT cells (yellow bars) upon the media exchange shows no statistically significant differences between the expression of the predicted neighbor-dependent genes in Astrocyte and EnT cells before and after the media exchange.The expression of these genes in total mouse hippocampus represents the control (green bars).The expression of the predicted neighbor-dependent genes was normalized for the expression of these genes in total mouse hippocampus (N = 5; bars represent averaged fold changes of gene expression relative to the control; error bars indicate mean AE SE).The expression of all genes follows a normal distribution, which was calculated by the Shapiro-Wilk test and the F-test was performed to study the equality of variances.The P-values shown in the bar plot were obtained by performing the two-tailed Student's t-test with a confidence interval of 95%.
Source data are available online for this figure.

▸Figure 2 .
Figure 2. Neighbor dependent-genes identified by CellNeighborEX in mouse hippocampus Slide-seq data.A The heterotypic spots of Endothelial tip and Astrocyte cells (EnT + A) (n = 121) were compared with the respective homotypic spots (EnT [n = 60], Astrocyte[n = 2,909]; log ratio > 0.4, P-value < 0.01) and additionally with the artificial heterotypic spots (FDR < 0.01).The statistical test was chosen among the two tailed Student's t-test, Welch's t-test, or Wilcoxon rank sum test depending on the sample size and heterogeneity of variance test.The volcano plot displays 3 up-regulated genes including Fabp7.In the heatmap, the genes are more highly expressed in EnT + A. The heterotypic spots also express both EnT and Astrocyte markers.B The spatial visualization shows the higher expression level of Fabp7 in EnT + A. C In the mouse hippocampus scRNA-seq data, Fabp7 is mostly expressed in Astrocyte.D In GO analysis, the GO terms are related to neuronal regulation.The bar plot presents top 20 GO terms.

Figure 4 .
Figure 4. Niche-specific gene expression in seqFISH.A The cells of the seqFISH data consist of 21 cell types except for Low quality (i.e., unidentified cells due to low quality).B The spatial mapping with RGB channels simultaneously visualizes the expression of three neighbor-dependent genes.Gut tube cells express Tbx1 (red) when adjacent to Cranial mesoderm, Pitx1 (green) when adjacent to Neural crest, and Foxf1 (blue) when adjacent to Splanchnic mesoderm.

Figure EV2 .
Figure EV2.Transcriptomic change due to direct cell contact in mouse embryo Slide-seq data.

▸Figure EV5 .
Figure EV5.Heterogeneity of endothelial tip (EnT) cells in mouse hippocampus Slide-seq data.A Neighboring cell type-dependent gene expression of EnT cells.EnT cells dominantly express Igfbp7 (red) when proximal to Choroid (EnT + Ch), Trf (green) when proximal to Astrocyte (EnT + A), and Plp1 (blue) when proximal to Interneuron (EnT + In).B Expression of neighbor-dependent genes in the mouse hippocampus scRNA-seq data.It confirms that the three genes are expressed from EnT. C UMAP of EnT cells.4 clusters were obtained through clustering analysis: Cluster 0 to 3. Igfbp7 is mostly expressed in Cluster 3, Trf in Cluster 0, Plp1 in Cluster 2, and none of them is expressed in Cluster 1. D Heterogeneity of EnT cells explained by niche-specific gene expression.Cluster 3 is EnT cells adjacent to Ch, Cluster 0 is EnT adjacent to A, Cluster 2 is EnT adjacent to In, and Cluster 0 is EnT adjacent to another EnT.
For instance, we found 694 Gut tube cells surrounded by the same cell type (homotypic neighbors) and 33 Gut tube cells adjacent to Neural crest cells among heterotypic neighbors.CellNeighborEX found that Gut tube cells express Pitx1 when adjacent to Neural crest cells (P-value < 0.01 and FDR < 0.05; Fig EV1A).To confirm this result, we visualized the expression of Pitx1 for the Gut tube cells together with the Neural crest cells.The Gut tube cells adjacent to Neural crest (Gut tube/Neural crest) are represented in red boundaries, while other Gut tube cells (Gut tube/Gut tube) are represented in blue boundaries.The Neural crest cells are shown in black boundaries.The rest of the cells are shown in gray , Cell-NeighborEX categorized cells based on neighboring cell types.After the DE analysis, CellNeighborEX detected 354 up-regulated genes from 22 types of heterotypic neighbors and 429 down-regulated genes from 22 types of heterotypic neighbors (Dataset EV1A and B).
Appendix Fig S4 and Dataset EV2).To see if the interactions are valid, we calculated minimum distances between the two interacting cell types using the spatial coordinates of the datasets, and obtained its distribution.The estimated distances ranged from 60 to 1,600 μm on average(Appendix Fig S5).This suggests that the interactions identified by NicheNet may include many false predictions.NicheNet only finds frequently interacting cell types based on the averaged gene expression and it does not examine interactions between individual cells.
(Browaeys et al, 2020) al, 2022) induction of de novo gene expression results from the communication established by EnT and Astrocyte.Neighbor-dependent genes are new potential genes involved in cell-cell interactions Ligand-receptor co-expression has been used to study cell-cell interactions in ST data (preprint: Pham et al, 2020; Garcia-Alonso et al, 2021;Li et al, 2021;Shao et al, 2022).To see if the use of ligand-receptor pairs and the downstream genes mediated by them can recover our genes detected by CellNeighborEX, we ran NicheNet(Browaeys et al, 2020)on the seqFISH and Slide-seq datasets (see Materials and Methods).NicheNet detected ligand-receptor pairs and their target genes on each dataset based on prior knowledge of signaling and gene regulatory networks.We found 174, 1, 12, and 11 genes commonly detected between the two approaches, respectively (

table Reagent
Figure EV1.Neighbor dependent-genes identified by CellNeighborEX in mouse embryo seqFISH data.A Gut tube cells adjacent to Neural crest cells (Gut tube/Neural crest) (n = 694) were compared with Gut tube cells proximal to other Gut tube cells (Gut tube/Gut tube; n = 33; log ratio > 0.4, FDR < 0.05).The statistical test was chosen among the two tailed Student's t-test, Welch's t-test, or Wilcoxon rank sum test depending on the sample size and heterogeneity of variance test.The volcano plot shows 23 up-regulated including Pitx1 and 21 down-regulated genes including Foxa1 in Gut tube/Neural crest.B The spatial visualization displays that Gut tube cells adjacent to Neural crest more highly express Pitx1.C For genes up-regulated by cell contact in the mouse embryo seqFISH data, GO analysis shows that the GO terms are associated with embryonic development.The bar plot presents top 20 GO terms.