| website | https://www.wrh.ox.ac.uk/team/michal-krassowski |
| Open source | @krassowski |
| https://linkedin.com/in/michal-krassowski/ |
| website | https://www.wrh.ox.ac.uk/team/michal-krassowski |
| Open source | @krassowski |
| https://linkedin.com/in/michal-krassowski/ |
If you ever wanted, you can now display and interact with LocusZoom plots directly in Python kernel for Jupyter: https://github.com/krassowski/jupyter-locuszoom
This is on PoC stage so please let me know if you find it useful in which case I will find time to polish the rough edges.
This is neat: CorDiffViz: an R Package for visualizing multi-omics differential correlation networks (https://github.com/sqyu/CorDiffViz)
Especially the demo website: https://diffcornet.github.io/CorDiffViz/demo.html - having an interactive network to play around with different methods is useful to get intuition/didactic purposes (but of course could be misused).
How to quickly obtain amino acid change information for all known variants in a given genomic region if ANNOVAR is an overkill? Well, of course use an API from someone who figured it out already. There is a number of services which can help (Entrez, Biomart, ALFA, UCSC).
I've added an example on how to do that from Python using new version of easy-entrez client for the Entrez API: https://github.com/krassowski/easy-entrez#obtaining-amino-acids-change-information-for-variants-in-given-range
Retrieve PubMed articles, text-mining annotations, or molecular data from >35 Entrez databases via easy to use Python package - built on top of Entrez E-utilities API. - GitHub - krassowski/easy...
I don't know what is more annoying: the fact that many articles are still being published behind a paywall on Wiley Online Library, or that we cannot use university credentials to get past the paywall them because the website has silly JavaScript errors like `Uncaught ReferenceError: thiss is not defined` (literally a typo dangling another day!). As if the system was designed to withheld knowledge!
GPTchat knows some things about Mendelian Randomisation.
While it easily produces nonsensical answers after being told the previous answer was wrong, the default answers are often correct. I expected the standard three assumptions but
Background Pathway enrichment extensively used in the analysis of Omics data for gaining biological insights into the functional roles of pre-defined subsets of genes, proteins and metabolites. A large number of methods have been proposed in the literature for this task. The vast majority of these methods use as input expression levels of the biomolecules under study together with their membership in pathways of interest. The latest generation of pathway enrichment methods also leverages information on the topology of the underlying pathways, which as evidence from their evaluation reveals, lead to improved sensitivity and specificity. Nevertheless, a systematic empirical comparison of such methods is still lacking, making selection of the most suitable method for a specific experimental setting challenging. This comparative study of nine network-based methods for pathway enrichment analysis aims to provide a systematic evaluation of their performance based on three real data sets with different number of features (genes/metabolites) and number of samples. Results The findings highlight both methodological and empirical differences across the nine methods. In particular, certain methods assess pathway enrichment due to differences both across expression levels and in the strength of the interconnectedness of the members of the pathway, while others only leverage differential expression levels. In the more challenging setting involving a metabolomics data set, the results show that methods that utilize both pieces of information (with NetGSA being a prototypical one) exhibit superior statistical power in detecting pathway enrichment. Conclusion The analysis reveals that a number of methods perform equally well when testing large size pathways, which is the case with genomic data. On the other hand, NetGSA that takes into consideration both differential expression of the biomolecules in the pathway, as well as changes in the topology exhibits a superior performance when testing small size pathways, which is usually the case for metabolomics data.