Recursive Mini Converters


Recursive Mini Converters is our newest tool.

This tool is a proof of concept for the notion of recursive mini converters. It is available at

If a single-line text field is used for input and output it can be called a two-way field. If this is the only text field that is visible at the front-end of a form, we may call said form a two-way mini form. If this form processes input and output by means of conversion factors, then said form defines a mini converter. If the output becomes the new input, a recursive mini converter is obtained.

With that in mind, we show that recursive mini converters:

—can be coded using pure PHP; i.e., no framework, JQuery, or JavaScript are needed.
—are suitable for small screen displays like smartphones and tablets.
—add non-intrusive unit conversion capabilities to other web tools when placed on same page.
—can coexist on a web page, independent from each other, and without collisions.

It remains to show that multiple mini converters can be incorporated into a single form, but this is not challenging.

Significant Figures Calculator


, , , ,

This tool, The Significant Figures Calculator, lets users compute and edit significant figures.

The tool is based on the theory of relative errors and reports results in conventional and scientific notation. Values involving ambiguous trailing zeros are properly counted.

A handy tool for teachers and students, and for those that need to report quantities to a given number of significant figures.

Giving a precise definition for the correct number of significant figures is quite subtle (Higham, 2002). Different rules for counting significant figures have been reported. There is also the fact that not all significant figures are meaningful figures.

The theory of relative errors can be used to derive safe counting guidelines. See Numerical Analysis, by Burden and Faires, pages 20-22. According to the theory, it can be stated that significant figures can be attributed only to measured quantities for which a relative error can be computed. Thus it is not possible to attribute significant figures to zero as a quantity, exact numbers, conversion factors, and non-measured constants. However, all digits of a measured constant are significant.

Although there are many similar calculators out there, many do not pass what can be called “the test of zeros”. That is, enter one or more zeros as a quantity (0, 0.0, 0.00,…) and check the result. According to the theory of relative errors, no significant figures attribution is possible. Expressing these in scientific notation does not add any artificial significance to them as we still cannot compute relative errors for them.

Let us finally address the question of how many significant figures are in “0.”, i.e., in a zero followed by a decimal point. Some authors claim that this zero has one significant figure. Their argument here is that the convention of trailing zeros ending with a decimal place applies. This implies that said zero is trailing itself. This is an invalid argument as we can also imply that said zero is a leading zero, leading itself. Again, no relative error can be computed for a self-trailing or self-leading zero.

Last updated: 5-23-2021

The Video Finder


, , ,

I always wanted to have an easy way of accessing news videos from national TV channels, without having to subscribe to them or pay for third-party tools that do this. So I built my own one: The Video Finder (

Users can submit a YouTube channel id or a url containing one and the tool returns a links list in feeds format. Videos on other categories or topics can be accessed with the tool. The tool is based on a previous one (The RAR Parser,

Now I have a way of building web pages on a given category (Human Resources, Chemistry, Genetics, Health, Sports,…), listing relevant channels. Great for educational or research purposes.

Fast reverse complement computation of DNA sequences without string concatenation loops


, ,

Arguments on why string concatenation loops (SCL) is a bad programming practice were given in a previous post.

You can also google for additional arguments against SCL.

We now show a simple PHP code that makes unnecessary SCL when computing DNA reverse complement sequences. Consider the sequence


This sequence corresponds to the first 21 nucleotides of the NCBI Reference Sequence NC_045512.2; i.e., to the first 21 nts of the complete genome of the severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1 ( ).

To compute its reverse complement sequence,


we just need to use the following one-liner:

echo strrev(strtr(‘ATTAAAGGTTTATACCTTCCC’, ‘ACGT’, ‘TGCA’));


No string splitting, concatenation, or looping are needed!

As I always say, graduate students and postdocs who can code have and edge in multidisciplinary research work over those that cannot code.

Why string concatenation in loops is a bad programming strategy.


, , , , ,

I’m finally back after a long vacation, thanks to the pandemic. Having said that, let’s go back to business.

Why string concatenation in loops (SCL) is a bad programming strategy.

String concatenation in loops (SCL), is often used in sequencing analysis; for instance, for finding complement and reverse complement sequences, for DNA to RNA translations, and so forth. It is also used for generating random sequences, and DNA data storing.

I’m currently writing an article on this subject. A little snippet follows:

“SCL is a common strategy because of its simplicity. In many programming languages, however, SCL is discouraged because generates a large number of temporary objects, consumes memory resources, and increases execution times. As sequences get larger, or if these are processed in batch mode, implementing SCL becomes a computationally inefficient strategy. “

I cannot wait to finish it and provide one-liner solutions, applicable across knowledge domains.

A search in Google for string concatenation in loops provides some reasons for avoiding SCL.

On Fake Research, “Academic” Spam, and ASEO


, , , ,

Oldie but goodie. Little has changed since then as can be seen from the comparative below.


Academic Search Engine Spam and Google Scholar’s Resilience Against it;rgn=main

What an Audacious Hoax Reveals About Academia
Three scholars wrote 20 fake papers using fashionable jargon to argue for ridiculous conclusions.


CSEO = Covid SEO

Covid Spam/Scam.

The Almost Binary Heuristic


, , ,

Yet another bond order calculation heuristic that still fail.

I describe this new heuristic, The Almost Binary Heuristic, in the “What is computed?” section of the bond order calculator at

The tool itself is described in the Bond Order Calculator Tool post at

The Almost Binary Heuristic is aimed at computing bond orders of diatomic species having up to 20 electrons in a straightforward manner. The heuristic can be used to reproduce the results of our bond order calculator tool.

I’ve included the php script that generates the so-called “phone number” trick for computing bond orders of diatomic species with up to 20 electrons.

Feel free to copy/rewrite the code with your favorite programming language or use it to build your own bond order calculator tool. Just please keep the credit lines in place. 🙂


Vector Space Explorer Tool


, , , , , ,

Vector Space Explorer Tool is a new tool from Minerazzi, available now at

VSE is aimed at exploring combinations of local (e.g., FREQ, ATF1,…) and global (e.g., IDF, IDFP,…) term weighting schemes for documents and queries. All kind of combinations can be easily explored.

The tool lists results in decreasing order of cosine similarity scores, with or without implementing stopwords removal and parametric corrections.

VSE was developed for computer science students and those interested in information retrieval systems so they can learn how IR systems work.

First-time users may want to try the examples provided by pressing the Try This Example button from the tool. It is possible to cycle through the examples by repetitive pressing this button. Some of the examples list top titles obtained by querying commercial search engines.

Accepting the default settings instructs VSE to remove stopwords and implement the FREQ Model, also known as the Term Count Model. This is one of the simplest vector space implementations where term weights are mere raw frequencies (term counts).

Revamping the Cosine Similarity Calculator Tool


, , ,

One of the most interesting problems in data mining and cluster analysis relates to the transformation of similarities into distances without breaking the triangular inequality condition for a distance metric.

Some of the transformations found in the literature are based on heuristics and tricks of the trade, or based on assumptions applicable to a given knowledge domain. This topic is discussed in our tutorial on distance and similarity (

We have incorporated to our Cosine Similarity Calculator a simple methodology that easily transforms cosine similarities into distances while obeying the requirements for a distance metric. It all boils down to mean-centering the variables.

Check our revamped and improved Cosine Similarity Calculator tool now at

RAR Parser |An RSS, ATOM, and RDF Parser


, , , , , ,

The RAR Parser is a tool that lets you read RSS, ATOM, and RDF news feeds, without subscribing. It is available at

A practical example: By submitting the MIT Health Sciences and Technology news feeds url, the following news relevant to COVID-19 were obtained, among others. Results might change as time evolves.

MIT scientist helps build Covid-19 resource to address shortage of face mask

MIT initiates mass manufacture of disposable face shields for Covid-19 response

An experimental peptide could block Covid-19

3 Questions: The risks of using 3D printing to make personal protective equipment