What is a promise in Javascript?

Question

Asked: 2021-11-27 10:59:12 +0800 CST 2021-11-27 10:59:12 +0800 CST 2021-11-27 10:59:12 +0800 CST

Count empty cells of a CSV in Shell

772

I have a CSV file with 16 variables that I would be interested in counting the missing column values from. I show the first rows of the file:

2009-01-09,,,0,,,,700.0,0.0,14,1.0,,,,3010,14
2009-01-10,,,0,,,,3050.0,0.0,61,1.0,,,,13129,61
2009-01-11,,,0,,,,4650.0,0.0,93,1.0,,,,20033,93
2009-01-12,,,7,,,,4700.0,0.0,102,1.0,5,,0.0,22031,94
2009-01-13,,,0,,,,6150.0,0.0,123,1.0,,,,26527,123
2009-01-14,,,1,,,,6450.0,0.0,133,1.0,0,,0.0,28276,129
2009-01-15,,,8,,,,6300.0,0.0,140,1.0,6,,0.0,30061,126
2009-01-16,,,2,,,,5400.0,0.0,114,1.0,0,,0.0,23854,108
2009-01-17,,,0,,,,5450.0,0.0,109,1.0,,,,23528,109

Practically all the references I have found on the internet have been to count the lines of a file, but not its missing data.

Would there be a way to count those cells where there are null values?

Thanks in advance.

1 Answers

Voted

Cuauhtli · Answer 1 · 2021-11-27T16:29:13+08:00

You can do it with awk.

Suppose we have this file:

$ cat - > archivo << __eof
0,1,2,3,4,5,6
,1,2,,,5,
0,1,2,,,5,
0,1,2,,,5,6
__eof

So we can execute a program-text in awk:

$ awk -F , '{
    vacios=0
    for (i=1; i <= NF; i++) if ($i == "") vacios++
    printf "Renglon: %s. Vacios: %s\n", NR, vacios}' archivo

And it gives us:

Renglon: 1. Vacios: 0
Renglon: 2. Vacios: 4
Renglon: 3. Vacios: 3
Renglon: 4. Vacios: 2

In this script, for each row, we check each field for empty fields. With -F ,, we make each field defined by commas.

Another option can be with grep, which, although it does not count the empty fields, only shows them:

$ grep -E '(^|,)(,|$)' archivo --color

There is no point in putting the output here because you need the color set by grepwith the parameter --color.

In this one-liner, only the complete lines are shown but with the empty fields between commas in red (it would be good to ask if you can only have this color with grep).

Another option that occurs to me and that seems somewhat inefficient to me (and that is why I put it last), is something like this:

$ xargs -I % -n1 bash -c "echo % | grep -oE '(^|,)(,|$)' | wc -l " < archivo | xargs -I % bash -c '[ % -ne 0 ] && expr % + 1 || echo %'

whose output is:

In this case, each rung will generate one output.

Count empty cells of a CSV in Shell

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?