What is a promise in Javascript?

Question

Asked: 2022-10-18 17:42:07 +0800 CST 2022-10-18 17:42:07 +0800 CST 2022-10-18 17:42:07 +0800 CST

How to count and separate values of a dataframe?

772

I have the following data.framegene that has more than 100 thousand values and they are found according to the total column between values of 221 and 213 (that is, there are values for: 213, 214, 215, 216, 217, 218, 219, 220 and 221)

            mix total
1      A2M-ACTB   221
2     A2M-ACTG1   221
3     A2M-ANXA1   221
4       A2M-APP   221
5       A2M-B2M   221
6      A2M-CD24   221
7      A2M-CD74   221
8    A2M-COL1A2   221
9    A2M-COL3A1   221
10      A2M-DSP   221
11   A2M-EEF1A1   221
12     A2M-ENO1   221
13      A2M-FN1   221
14    A2M-GAPDH   221
15    A2M-HLA-B   221
16 A2M-HSP90AB1   221
17      A2M-MGP   221
18   A2M-RPL13A   221
19     A2M-RPS6   221
20   A2M-TM4SF1   221

And what I would like to do first is to count the number of rows of genes that have 221, then 220, and so on up to 213. Then I would like to separate each count into individual objects.

3 Answers

Voted

Sicabí · Answer 1 · 2022-10-18T22:15:59+08:00

The function tapply()allows you to apply a function to the values of a variable according to the categories of another variable. In this case, the function we can apply to count the categories in totalis length().

# datos de ejemplo
nrows <- ceiling(seq(100,250, length = 14))
genes <- data.frame(
mix = replicate(sum(nrows), expr =  paste0(sample(x = letters,size = 7, 
                                                  replace = T), collapse = "")),
total = rep(c(212, 213, 214, 215, 216, 217, 218, 219, 220,221,222,234,1000,10), 
    times = nrows))
genes

# conteo
conteo <- tapply(X = genes$mix ,INDEX = genes$total , FUN = length)

conteo[as.character(c(213:221))]
213 214 215 216 217 218 219 220 221 
112 124 135 147 158 170 181 193 204

R18 · Answer 2 · 2022-10-19T04:22:21+08:00

Hello!

I give the data set as an example irisand then I indicate how it would be done with your data set.

1) With the data setiris

# Cargar los datos
  data("iris")
  
# Separar por nombre de "Species"
  data.sep <- split(iris, iris$Species)
  
# Número de casos para cada variable
  lapply(data.sep, nrow)
  table(iris$Species)

2) With your data (being datayour name)

# 2.1) Separar por nombre de "total"
  data.sep <- split(data, data$Stotal)
  
# 2.2) Número de casos para cada variable
  lapply(data.sep, nrow)

# 2.2.b) Número de casos para cada variable (en tabla)
  table(data$total)

In case you are a new user, I explain a little what each step does:

Step 2.1) generates a "list" type object with as many elements as different values for the variabletotal
Step 2.2) calculates for each element of the previous list, through the function nrow()the number of rows of that sub-matrix, that is, the number of genes that have the same value for the variabletotal
Step 2.2.b) performs the same as step 2.2 but generates a table indicating the number of cases for each of the different values of the variable total.

Patricio Moracho · Answer 3 · 2022-10-19T04:47:08+08:00

If you dare to enter the mode tidyto work with the data, you will see that it is quite simple to elaborate what you ask for. Assuming an example similar to yours:

df <- data.frame(mix = runif(100), 
                 total = sample(200:225, 100, replace = TRUE))

The way to work is to apply step by step the transformations of the data that are necessary, in this case

Filter only those totalfrom a previously defined set
group bytotal
Summarize by counting the number of rows for each group

library(tidyverse)

df %>% 
  filter(total %in% c(213, 214, 215, 216, 217, 218, 219, 220,221)) %>% 
  group_by(total) %>% 
  summarise(cantidad=n())

# A tibble: 9 x 2
  total cantidad
  <int>    <int>
1   213        4
2   214        3
3   215        1
4   216        3
5   217        7
6   218        1
7   219        6
8   220        2
9   221        6

As for separating each count into individual objects , I imagine you are talking about transforming each row into an element of a list, which you can do by applying the verb group_split():

df %>% 
  filter(total %in% c(213, 214, 215, 216, 217, 218, 219, 220,221)) %>% 
  group_by(total) %>% 
  summarise(cantidad=n()) %>% 
  group_split(total)

How to count and separate values of a dataframe?

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?

How to count and separate values ​​of a dataframe?

3 Answers

How to count and separate values of a dataframe?