Carcinomas Miner


, , , ,

Carcinomas Miner | Find resources relevant to the types of cancers known as carcinomas (

Recrawl search results and build your own curated collection of resources. Use this miner to extract valuable front/back end data from relevant sites.

You may also use its rss news channels ( to find news from around the Web relevant to these and other types of cancers.


Bond Order Calculator Tool


, , , ,


This tool computes bond orders of diatomic species having up to 20 electrons, without using Molecular Orbital Theory! It is available at

We developed the tool inspired in Dr. Arijit Das set of innovative and time economic formulae for chemical education. His methodologies are suitable for computer-based learning (CBL) activities or for writing computer programs for solving chemistry problems.

In my opinion, students who know how to write programs for solving chemistry problems have an edge when taking quantitative courses like analytical chemistry, instrumental analysis, chemometrics, computational chemistry, and similar courses. I think they might be better prepared for multidisciplinary research work.

Developing this tool was really gratifying as the work inspired us to derive an algorithm for predicting number of unpaired electrons and magnetic properties of single atoms, diatomic species, and their ions. Hopefully, this algorithm will be available early next year in the form of a new chemistry calculator.

We are sincerely in debt to Dr. Arijit Das from Ramthakur College, Agartala, West Tripura, India for encouraging us to develop this tool for educators, scholars, and chemistry students.

Applications of Binary Fractals to Long Genetic Sequences via a Kronecker Family of Genetic Matrices


, , , , , , ,

CSS Fractal Studio

Back in 2017, Stepanyan & Petoukhov reported that long nucleotide sequences can be modeled as binary fractals by means of Kronecker exponentiation of matrices.

see also

Abstract reads in part:

“This method uses a set of symmetries of biochemical attributes of nucleotides. It also uses the possibility of presentation of every whole set of N-mers as one of the members of a Kronecker family of genetic matrices. With this method, a long nucleotide sequence can be visually represented as an individual fractal-like mosaic or another regular mosaic of binary type.”

We added the fractal resembling the pattern of the nucleotide sequence Homo sapiens chromosome 22 genomic scaffold into our Fractal Studio tool at

Researchers can reproduce its binary mosaic, shown above, by just selecting the Homo Sapiens Mosaic option from the tool selection menu. Compare results with Figures 4 and 8 of Stepanyan & Petoukhov article. Compare also some multifractals that the tool generates with some of the genetic mosaics described in the article.

Multidisciplinary research is a beautiful thing.

On Matrix Determinants, Poems, and Symmetry


, , , ,


Reverend Charles Lutwidge Dodgson, better known by his pseudonym Lewis Carroll as the creator of Alice in Wonderland, proposed in 1866 a new method for computing matrix determinants called Condensation of Determinants.

An elementary proof of Dodgson’s Condensation Method is available from Main, Donor, and Harwood.

Dodgson was a real genious, although with some dark sides Wikipedia.

Anyway, hidden in his poems are some gems of linear algebra. The following lines written by Lewis Carroll read the same horizontally and vertically when cases and nonletter characters are ignored.

I often wondered when I cursed,
Often feared where I would be—
Wondered where she’d yield her love
When I yield, so will she,
I would her will be pitied!
Cursed be love! She pitied me…

See figure. That is an example of a symmetric matrix.

For Math Teachers

Here is a nice homework: Assign integer values to unique words of the poem and compute its determinant with Dodgson’s Method.

Challenge 1: Starting at the top-left corner, read matrix diagonal elements. Explain the meaning of the diagonal message in the Carroll and then in the Dodgson’s sense.

Challenge 2: There are also other hidden messages in that matrix to read. Hint: Move through rows/cols adopting a traveling pattern. Can you find them?

On the Non-Additivity of Correlation Coefficients Part 3: The Bias & Nature of Correlation Coefficients


, , , ,

statistical relationships
This the third and last part of a tutorial series on the non-additivity of correlation coefficients.

Their bias & nature, transformations, and approximations to normality are discussed. The risks of blindly transforming scores to ranks or arbitrarily converting r-to-Z values/Z-to-r values (Fisher Transformations) are discussed. Shifted up cosine approximations to normality are also covered.

Not all researchers know that score-to-rank transformations can change the sampling distribution of a statistic (e.g. a correlation coefficient) and that Fisher transformations are sensitive to normality violations. Combining both types of transformations is a recipe for a statistical disaster.

Alas, some meta analysis and data analytic folks are guilty of that.

Hybrid Similarity Search (HSS) Algorithm for Chemistry Searching


, , ,

Hybrid Similarity Search (HSS) Algorithm for Chemistry Searching for Fentanyl-related compounds and other drugs.
Free version:

This is a news from NIST back in March ( ) and found with the NIST RSS channel of the Chemical Substances Miner

It is a nice example of Information Retrieval applied to Chemistry. They used a modified cosine similarity function. I see possible applications to topic analysis.

Original Source:
Anal. Chem., 2017, 89 (24), pp 13261–13268 DOI: 10.1021/acs.analchem.7b03320

“A mass spectral library search algorithm that identifies compounds that differ from library compounds by a single “inert” structural component is described. This algorithm, the Hybrid Similarity Search, generates a similarity score based on matching both fragment ions and neutral losses. It employs the parameter DeltaMass, defined as the mass difference between query and library compounds, to shift neutral loss peaks in the library spectrum to match corresponding neutral loss peaks in the query spectrum. When the spectra being compared differ by a single structural feature, these matching neutral loss peaks should contain that structural feature. This method extends the scope of the library to include spectra of “nearest-neighbor” compounds that differ from library compounds by a single chemical moiety. Additionally, determination of the structural origin of the shifted peaks can aid in the determination of the chemical structure and fragmentation mechanism of the query compound. A variety of examples are presented, including the identification of designer drugs and chemical derivatives not present in the library.”

C.R. Rao: A Giant & Living Legend of Statistics Among Us


, , , , ,

Curating collections requires going to original sources which is gratifying.

As part of the effort of building a miner on the golden age of Statistics, I researched those from Ronald Fisher times who might still alive. I found one researcher that precisely is Fisher’s only PhD: Calyampudi Radhakrishna Rao, now 98.

I asked Dr. Rao for help in identifying important references and moments from those times. He graciously sent me his CV listing references to all of his glorious books (15), articles (477), and moments.

Even in his retirement he is still publishing:

Dr. Rao also sent me a PDF with historical photos of him with Mahalanobis, Prime Minister Nehru, Prime Minister Indira Gandhi, and others, and of many glorious moments from his career. What an honor!

His work has impacted so many fields that there are several technical terms bearing his name.

Here is an appealing quote from him:

“We study physics to solve problems in physics, chemistry to solve problems in chemistry, and botany to solve problems in botany. There are no statistical problems which we solve using statistics. We use statistics to provide a course of action with minimum risk in all areas of human endeavor under available evidence. — C. R. Rao”

On Mind Retrieval: BrainNet


, , ,

This post is part of a series on Mind Retrieval that we started long time ago back in 2010. Links to previous posts can be found here:

The Internet can be thought of as a network of nodes where each node has an assigned IP address. The Web and Deep Web are subsets of this. Traversing these nodes is done with crawlers. To retrieve the information that flows across them is what we know as information retrieval. The information might consists of data, text, sound, images. The text might convey numbers, words, phrases, topics, ideas, theses…

Now if the nodes are human brains, the information extraction from the part of the brain responsible of thoughts, emotions, senses, can be loosely termed Mind Retrieval.

So Mind Retrieval requires communications between nodes (brains). The BrainNet Project represents one step closer toward this direction. Here is an interesting link ( where three human brains were “connected” to accomplish specific tasks.

Imagine a large scale project where the brightest minds can interact to take on bigger tasks or solve important scientific and practical problems. Imagine artificial brains (bot brains) doing the same.

When I started this series years ago, many did not believe in the idea of mind retrieval, teasing it as mere speculations.

I guess they have been retracted since then. Mind Retrieval and brain-to-brain social networks are at a corner near you, along with peripherals: browsers, reprogramming, adverts, teleports, hacks, etc.

On Men and Ideas: Fisher vs. Pearson


, , ,

Ronald Aylmer Fisher was considered an outsider by the statistical establishment of his time.

The links below (1-3) show his struggles & nuances with Karl Pearson, his son Egon, Bowley, their followers, and the Royal Statistical Society (RSS). His life was a story of accomplishments and noise (deceptions and nasty RSS politics). He was too ahead of his time.

That reminds me of the struggles of another maverick: Benoit Mandelbrot. Eventually and like Mandelbrot, Fisher greatness was recognized. Also like Mandelbrot, he was able to boost the signal-to-noise of his career and life.

Most statisticians consider Fisher the Father of Modern Statistics (, even when he was not allowed to teach Statistics at the University of Cambridge (they tried to silence Fisher).

Yes, scientists too can be demeaning to other scientists, more for personal reasons than for ideas and the Scientific Method. After all, they are also mostly carbon units called “humans”.

1. Fisher in 1921

2. Fisher vs Pearson: A 1935 Exchange from Nature

3. Fisher: The Outsider
R. A. Fisher: how an outsider revolutionized statistics