Parsing the numerical output from Sensovation SensoSpot image analysis.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

96 lines
3.2 KiB

Sensospot Data Parser
=====================
3 years ago
Parsing the numerical output from [SensoSpot][sensospot] microarray analysis.
The [SensoSpot][sensospot] microarray analyzer is an automated fluorescence microscope with an image analysis software for detecting and measuring microarrays. The original name of the product was "FLAIR" by the company Sensovation, that was later acquired by Miltenyi.
3 years ago
There is no affiliation on my side regarding Sensovation or Miltenyi, I just use the product and needed a way to make the data available for further analysis.
## Example:
```python
3 years ago
import sensospot_parser
# read the raw data of a folder
3 years ago
raw_data = sensospot_parser.parse_folder(<path to results directory>)
4 years ago
sorted(raw_data.columns) == [
'Analysis.Datetime', 'Analysis.Image', 'Analysis.Name',
'Bkg.Area', 'Bkg.Mean', 'Bkg.Median', 'Bkg.StdDev', 'Bkg.Sum',
'Exposure.Id',
'Parameters.Channel', 'Parameters.Time',
'Pos.Id', 'Pos.Nom.X', 'Pos.Nom.Y', 'Pos.X', 'Pos.Y',
'Spot.Area', 'Spot.Diameter', 'Spot.Found', 'Spot.Mean', 'Spot.Median',
'Spot.Saturation', 'Spot.StdDev', 'Spot.Sum',
3 years ago
'Well.Column', 'Well.Name', 'Well.Row'
]
```
## Constants
There is a `columns` module available, providing constans that define the column names.
```python
3 years ago
import sensospot_parser
3 years ago
sensospot_parser.columns.ANALYSIS_NAME == "Analysis.Name"
```
## Avaliable public functions:
4 years ago
All public functions return a [pandas DataFrame][pandas] object.
Be aware that some columns might contain no values. This is depending on the parsing
method (xml or csv) and if a parameters file could be found or not.
4 years ago
- **parse_folder(path_to_folder)**
Tries the `parse_xml_folder()` function first and if an error occurs,
it falls back to the `parse_csv_folder()`
- **parse_xml_folder(path_to_folder)**
Searches the folder for a parsable Sensospot XML result file and parses it into
a pandas data frame. It will add additional meta data from parameters folder,
if it is present.
- **parse_csv_folder(path_to_folder)**
Searches the folder for parsable Sensospot .csv files, parses them into one
big pandas data frame and will add additional meta data from parameters folder,
if it is present.
4 years ago
## CLI
For the (propably) most important function, there is a cli command
4 years ago
```sh
Usage: sensospot_parse [OPTIONS] SOURCES
4 years ago
Arguments:
SOURCES: One or more folders with Sensospot measurements
4 years ago
Options:
-o, --output FILE Output file path, defaults to 'collected_data.csv'
-q, --quiet Ignore sanity check for csv file parsing
4 years ago
--help Show this message and exit.
```
## Development
To install the development version of Sensovation Data Parser:
git clone https://git.cpi.imtek.uni-freiburg.de/holgi/sensospot_data.git
# create a virtual environment and install all required dev dependencies
cd sensospot_data
make devenv
To run the tests, use `make tests` (failing on first error) or `make coverage` for a complete report.
3 years ago
To generate the documentation pages use `make docs` or `make serve-docs` for
starting a webserver with the generated documentation
3 years ago
[sensospot]: https://www.miltenyi-imaging.com/products/sensospot
[pandas]: https://pandas.pydata.org/docs/reference/frame.html