Skip to content

Commit 3c09bbc

Browse files
committed
- Changed dependency: arboretum package was renamed to arboreto.
- Set version of loompy package to 2.0.2 because setup.py in more recent versions is not compatible with python 3.5.
1 parent 4bd545b commit 3c09bbc

8 files changed

Lines changed: 24 additions & 24 deletions

File tree

README.rst

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -14,13 +14,13 @@ in no time. The latter is achieved via the dask_ framework for distributed compu
1414

1515
The pipeline has three steps:
1616

17-
1. First transcription factors (TFs) and their target genes, together defining a regulon, are derived using gene inference methods which solely rely on correlations between expression of genes across cells. The arboretum_ package is used for this step.
17+
1. First transcription factors (TFs) and their target genes, together defining a regulon, are derived using gene inference methods which solely rely on correlations between expression of genes across cells. The arboreto_ package is used for this step.
1818
2. These regulons are refined by pruning targets that do not have an enrichment for a corresponding motif of the TF effectively separating direct from indirect targets based on the presence of cis-regulatory footprints.
1919
3. Finally, the original cells are differentiated and clustered on the activity of these discovered regulons.
2020

2121

2222
.. note::
23-
The most impactfull speed improvement is introduced by the arboretum_ package in step 1. This package provides an alternative to GENIE3 [3]_ called GRNBoost2. This package can be controlled from within pySCENIC.
23+
The most impactfull speed improvement is introduced by the arboreto_ package in step 1. This package provides an alternative to GENIE3 [3]_ called GRNBoost2. This package can be controlled from within pySCENIC.
2424

2525

2626
.. sidebar:: **Quick Start**
@@ -103,8 +103,8 @@ First we import the necessary modules and declare some constants:
103103
104104
from dask.diagnostics import ProgressBar
105105
106-
from arboretum.utils import load_tf_names
107-
from arboretum.algo import grnboost2
106+
from arboreto.utils import load_tf_names
107+
from arboreto.algo import grnboost2
108108
109109
from pyscenic.rnkdb import FeatherRankingDatabase as RankingDatabase
110110
from pyscenic.utils import modules_from_adjacencies, load_motifs
@@ -172,10 +172,10 @@ Phase I: Inference of co-expression modules
172172
In the initial phase of the pySCENIC pipeline the single cell expression profiles are used to infer
173173
co-expression modules from.
174174

175-
Run GENIE3 or GRNBoost from arboretum_ to infer co-expression modules
175+
Run GENIE3 or GRNBoost from arboreto_ to infer co-expression modules
176176
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
177177

178-
The arboretum package is used for this phase of the pipeline. For this notebook only a sample of 1,000 cells is used
178+
The arboreto package is used for this phase of the pipeline. For this notebook only a sample of 1,000 cells is used
179179
for the co-expression module inference is used.
180180

181181
.. code-block:: python
@@ -321,7 +321,7 @@ References
321321
.. _distributed: https://distributed.readthedocs.io/en/latest/
322322
.. _LCB: https://aertslab.org
323323
.. _feather: https://github.com/wesm/feather
324-
.. _arboretum: https://arboretum.readthedocs.io
324+
.. _arboreto: https://arboreto.readthedocs.io
325325
.. _notebooks: https://github.com/aertslab/pySCENIC/tree/master/notebooks
326326
.. _issue: https://github.com/aertslab/pySCENIC/issues/new
327327
.. _SCENIC: http://scenic.aertslab.org

docs/index.rst

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -14,12 +14,12 @@ in no time. The latter is achieved via the dask_ framework for distributed compu
1414

1515
The pipeline has three steps:
1616

17-
1. First transcription factors (TFs) and their target genes, together defining a regulon, are derived using gene inference methods which solely rely on correlations between expression of genes across cells. The arboretum_ package is used for this step.
17+
1. First transcription factors (TFs) and their target genes, together defining a regulon, are derived using gene inference methods which solely rely on correlations between expression of genes across cells. The arboreto_ package is used for this step.
1818
2. These regulons are refined by pruning targets that do not have an enrichment for a corresponding motif of the TF effectively separating direct from indirect targets based on the presence of cis-regulatory footprints.
1919
3. Finally, the original cells are differentiated and clustered on the activity of these discovered regulons.
2020

2121
.. note::
22-
The most impactfull speed improvement is introduced by the arboretum_ package in step 1. This package provides an alternative to GENIE3 [3]_ called GRNBoost2. This package can be controlled from within pySCENIC.
22+
The most impactfull speed improvement is introduced by the arboreto_ package in step 1. This package provides an alternative to GENIE3 [3]_ called GRNBoost2. This package can be controlled from within pySCENIC.
2323

2424
.. sidebar:: **Quick Start**
2525

@@ -97,8 +97,8 @@ First we import the necessary modules and declare some constants:
9797
9898
from dask.diagnostics import ProgressBar
9999
100-
from arboretum.utils import load_tf_names
101-
from arboretum.algo import grnboost2
100+
from arboreto.utils import load_tf_names
101+
from arboreto.algo import grnboost2
102102
103103
from pyscenic.rnkdb import FeatherRankingDatabase as RankingDatabase
104104
from pyscenic.utils import modules_from_adjacencies, load_motifs
@@ -166,10 +166,10 @@ Phase I: Inference of co-expression modules
166166
In the initial phase of the pySCENIC pipeline the single cell expression profiles are used to infer
167167
co-expression modules from.
168168

169-
Run GENIE3 or GRNBoost from arboretum_ to infer co-expression modules
169+
Run GENIE3 or GRNBoost from arboreto_ to infer co-expression modules
170170
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
171171

172-
The arboretum package is used for this phase of the pipeline. For this notebook only a sample of 1,000 cells is used
172+
The arboreto package is used for this phase of the pipeline. For this notebook only a sample of 1,000 cells is used
173173
for the co-expression module inference is used.
174174

175175
.. code-block:: python
@@ -315,7 +315,7 @@ References
315315
.. _distributed: https://distributed.readthedocs.io/en/latest/
316316
.. _LCB: https://aertslab.org
317317
.. _feather: https://github.com/wesm/feather
318-
.. _arboretum: https://arboretum.readthedocs.io
318+
.. _arboreto: https://arboreto.readthedocs.io
319319
.. _notebooks: https://github.com/aertslab/pySCENIC/tree/master/notebooks
320320
.. _issue: https://github.com/aertslab/pySCENIC/issues/new
321321
.. _SCENIC: http://scenic.aertslab.org

notebooks/pySCENIC - Full pipeline.ipynb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,8 +26,8 @@
2626
"import os, glob\n",
2727
"import pickle\n",
2828
"\n",
29-
"from arboretum.utils import load_tf_names\n",
30-
"from arboretum.algo import grnboost2\n",
29+
"from arboreto.utils import load_tf_names\n",
30+
"from arboreto.algo import grnboost2\n",
3131
"\n",
3232
"from pyscenic.rnkdb import FeatherRankingDatabase as RankingDatabase\n",
3333
"from pyscenic.utils import modules_from_adjacencies\n",

requirements.dev.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
cytoolz
2-
arboretum
2+
arboreto
33
multiprocessing_on_dill
44
numba==0.37.0
55
llvmlite==0.22.0

requirements.txt

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,16 +9,15 @@ pandas>=0.20.1
99
cloudpickle
1010
dask>=0.17.2
1111
distributed>=1.21.6
12-
arboretum
1312
pyarrow==0.8.0
1413
feather-format
15-
arboretum
14+
arboreto
1615
boltons
1716
setuptools
1817
pyyaml
1918
tqdm
2019
interlap
2120
umap-learn
22-
loompy
21+
loompy==2.0.2
2322
networkx
2423
matplotlib

scripts/hpc-grnboost.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@
44
import os, sys, glob
55
import datetime
66
from configparser import ConfigParser
7-
from arboretum.algo import grnboost2
8-
from arboretum.utils import load_tf_names
7+
from arboreto.algo import grnboost2
8+
from arboreto.utils import load_tf_names
99
from dask.distributed import LocalCluster, Client
1010
#from cytoolz import mapcat
1111
import logging

src/pyscenic/cli/pyscenic.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,8 @@
55
import logging
66
from dask.diagnostics import ProgressBar
77
from multiprocessing import cpu_count
8-
from arboretum.algo import grnboost2
9-
from arboretum.utils import load_tf_names
8+
from arboreto.algo import grnboost2
9+
from arboreto.utils import load_tf_names
1010

1111
from pyscenic.utils import load_from_yaml, modules_from_adjacencies
1212
from pyscenic.rnkdb import opendb, RankingDatabase

src/pyscenic/prune.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -248,6 +248,7 @@ def wrap(data):
248248
#def memoize(db: Type[RankingDatabase]) -> Type[RankingDatabase]:
249249
# return MemoryDecorator(db)
250250
#delayed_or_future_dbs = list(map(wrap, map(memoize, rnkdbs)))
251+
# Check also latest Stackoverflow message: https://stackoverflow.com/questions/50795901/dask-scatter-broadcast-a-list
251252
delayed_or_future_dbs = list(map(wrap, rnkdbs))
252253
# 3. The gene signatures: these signatures become large when chunking them, therefore chunking is overruled
253254
# when using dask.distributed.

0 commit comments

Comments
 (0)