What is a promise in Javascript?

Question

Asked: 2020-11-22 02:46:43 +0800 CST 2020-11-22 02:46:43 +0800 CST 2020-11-22 02:46:43 +0800 CST

Compare and selectively rewrite a csv file

772

I have a csv_1 that, simplified, has the following structure:

A,B,C
1,34,55
2,45,54
3,77,90
4,89,98

A second csv_2 that simplified, has the following structure:

a,b
1,Y
4,Y

I'm trying to write a third csv_3 file that will write all rows of data from csv_1 except those that appear in csv_2. That is, csv_3 in this case would be like this:

A,B,C
2,45,54
3,77,90

I am trying this:

import csv
with open("csv_1.csv", 'r', encoding = 'utf8') as f1,\
        open("csv_2.csv", "r", encoding = 'utf8') as f2,\
            open("csv_3.csv", "w", encoding = 'utf8') as f3:
    reader1 = csv.DictReader(f1, dialect='unix', delimiter=",",
                            quotechar='"', quoting=csv.QUOTE_MINIMAL)
    reader2 = csv.DictReader(f2, dialect='unix', delimiter=",",
                            quotechar='"', quoting=csv.QUOTE_MINIMAL)
    writer = csv.DictWriter(f3, dialect='unix', delimiter=",", quotechar='"',
                            fieldnames=("A","B","C"),
                            quoting=csv.QUOTE_MINIMAL)
    writer.writerow()
    for row1 in reader1:
        if row1["A"] not in reader2:
            writer.writerow(row1)

1 Answers

Voted

FJSevilla · Answer 1 · 2020-11-22T03:17:16+08:00

csv.DictReaderreturns an iterator, so you can't do a search on it indirectly, you must get the column from the second file and store it in some data structure, preferably in a set, in order to perform the search.

To write the header if you use you csv.DictWritermust use the writeheader.

The code should be something like this:

import csv


with open("csv_1.csv", 'r', encoding = 'utf8') as f1,\
        open("csv_2.csv", "r", encoding = 'utf8') as f2,\
            open("csv_3.csv", "w", encoding = 'utf8') as f3:

    reader1 = csv.DictReader(f1, dialect='unix', delimiter=",",
                             quotechar='"', quoting=csv.QUOTE_MINIMAL)
    reader2 = csv.DictReader(f2, dialect='unix', delimiter=",",
                             quotechar='"', quoting=csv.QUOTE_MINIMAL)
    writer = csv.DictWriter(f3, dialect='unix', delimiter=",", quotechar='"',
                            fieldnames=reader1.fieldnames, quoting=csv.QUOTE_MINIMAL)

    csv2_A_col = {row["a"] for row in reader2}

    writer.writeheader()
    for row in reader1:
        if row["A"] not in csv2_A_col:
            writer.writerow(row)

You can also use writerowsand a generator:

csv2_A_col = {row["a"] for row in reader2}
writer.writeheader()
writer.writerows(row for row in reader1 if row["A"] not in csv2_A_col)

If we want to write to the output file only some columns of the input file we can use the argument extrasactionwith the value 'ignore'. For example, for the example above, if we just want to get the columns Aand Cfrom, csv_1just do:

writer = csv.DictWriter(f3, extrasaction='ignore', dialect='unix', delimiter=",",
                        quotechar='"', fieldnames=("A","C"), quoting=csv.QUOTE_MINIMAL)
csv2_A_col = {row["a"] for row in reader2}
writer.writeheader()
writer.writerows(row for row in reader1 if row["A"] not in csv2_A_col)

With what we get:

A,C
2.54
3.90

Compare and selectively rewrite a csv file

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?