R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (2024)

This tutorial shows how to compute quantiles in the R programming language.

The article is mainly based on the quantile() R function. So let’s have a look at the basic R syntax and the definition of the quantile function first:

Basic R Syntax of quantile():

quantile(x)

Definition of quantile():

The quantile function computes the sample quantiles of a numeric input vector.

In the following R tutorial, I’ll explain in six examples how to use the quantile function to compute metrics such as quartiles, quintiles, deciles, or percentiles.

Let’s dive in!

Example 1: Basic Application of quantile() in R

In the first example, I’ll illustrate how to use the quantile function in its simplest way. Let’s create an exemplifying numeric vector first:

set.seed(15051) # Set seed for reproducibility x <- round(runif(1000, 0, 100)) # Create uniformly distributed datax # Print data to RStudio console# 73 44 4 2 3 78 15 38 59 70 80...

Our example vector contains 1,000 elements between the range of 1 and 100.

Now, we can apply the quantile R function to this vector as follows:

quantile(x) # Apply quantile function# 0% 25% 50% 75% 100% # 0 23 50 75 100

As you can see based on the RStudio console output, the quantile function returns the cutpoints (i.e. 0%, 25%, 50%, 75%, and 100%) as well as the corresponding quantiles.

Note: By default, the quantile function is returning the quartile (i.e. five cutpoints). Later on, I’ll show you how to get other metrics as well.

However, let’s first have a look at a common problem when the quantile function is applied…

Example 2: Handling NA Values with the quantile Function

In this example, you’ll learn how to deal with missing data (i.e. NA values) in the input vector. Let’s first insert an NA value to our example data:

x_NA <- c(x, NA) # Create example data with NA

Now, if we apply the quantile function to this vector, the quantile function returns an error message:

quantile(x_NA) # Apply quantile function to NA vector# Error in quantile.default(x_NA)

Fortunately, we can easily fix this error by specifying na.rm = TRUE within the quantile command:

quantile(x_NA, na.rm = TRUE) # Use na.rm argument# 0% 25% 50% 75% 100% # 0 23 50 75 100

Same output as in Example 1 – Perfect.

Example 3: Extract Quantile Values Only

AS you have seen based on the previous examples, the quantile function returns the cutpoints AND the corresponding values to the RStudio console. In some cases, however, we might prefer to keep only the quantile values.

In this case, we can simply apply the unname function to the output of the quantile function. Have a look at the following R code:

unname(quantile(x)) # Get only the quantile values# 0 23 50 75 100

Based on this R code, we only get the quantile values.

Example 4: Quantile by Group

In this example I’ll show you how to calculate the quantiles of certain subgroups. For the example, I’m going to use the Iris data matrix. Let’s load the data to R:

data(iris) # Load Iris datahead(iris) # Head of Iris data

R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (1)

Table 1: The Iris Data Frame.

The Iris data set contains several numeric variables and the grouping variable Species.

We can now produce a data matrix of quantiles of the first column grouped by the Species column with the following R syntax:

do.call("rbind", tapply(iris$Sepal.Length, # Specify numeric column iris$Species, # Specify group variable quantile))# 0% 25% 50% 75% 100%# setosa 4.3 4.800 5.0 5.2 5.8# versicolor 4.9 5.600 5.9 6.3 7.0# virginica 4.9 6.225 6.5 6.9 7.9

Note that it would also be possible to calculate quantiles by group based on functions of the tidyverse. This tutorial demonstrates how to calculate quantiles by group using the dplyr package.

Example 5: Quartiles, Quintiles, Deciles, Percentiles & Many More

As I told you before, the quantile function returns the quartile of the input vector by default. However, we can use the probs argument to get basically any quantile metric that we want.

With the following R codes, we can calculate the median…

quantile(x, probs = 0.5) # Median# 50% # 50

…tertiles…

quantile(x, probs = seq(0, 1, 1/3)) # Tertiles# 0% 33.33333% 66.66667% 100% # 0 34 68 100

…quartiles (as it would also be computed by default)…

quantile(x, probs = seq(0, 1, 1/4)) # Quartiles# 0% 25% 50% 75% 100% # 0 23 50 75 100

…quintiles…

quantile(x, probs = seq(0, 1, 1/5)) # Quintiles# 0% 20% 40% 60% 80% 100% # 0 18 40 61 80 100

…sextiles…

quantile(x, probs = seq(0, 1, 1/6)) # Sextiles# 0% 16.66667% 33.33333% 50% 66.66667% 83.33333% 100% # 0 15 34 50 68 83 100

…septiles…

quantile(x, probs = seq(0, 1, 1/7)) # Septiles# 0% 14.28571% 28.57143% 42.85714% 57.14286% 71.42857% 85.71429% 100% # 0 13 27 43 58 72 86 100

…octiles…

quantile(x, probs = seq(0, 1, 1/8)) # Octiles# 0% 12.5% 25% 37.5% 50% 62.5% 75% 87.5% 100% # 0.000 11.875 23.000 38.000 50.000 63.000 75.000 88.000 100.000

…deciles…

quantile(x, probs = seq(0, 1, 1/10)) # Deciles# 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% # 0 9 18 29 40 50 61 71 80 90 100

…duo-deciles…

quantile(x, probs = seq(0, 1, 1/12)) # Duo-deciles or dodeciles# 0% 8.333333% 16.66667% 25% 33.33333% 41.66667% 50% 58.33333% 66.66667% 75% 83.33333% 91.66667% 100% # 0 8 15 23 34 42 50 59 68 75 83 92 100

…hexadeciles…

quantile(x, probs = seq(0, 1, 1/16)) # Hexadeciles# 0% 6.25% 12.5% 18.75% 25% 31.25% 37.5% 43.75% 50% 56.25% 62.5% 68.75% 75% 81.25% 87.5% 93.75% 100% # 0.0000 7.0000 11.8750 17.0000 23.0000 30.1875 38.0000 44.0625 50.0000 57.0000 63.0000 70.0000 75.0000 81.0000 88.0000 94.0000 100.0000

…ventiles…

quantile(x, probs = seq(0, 1, 1/20)) # Ventiles, vigintiles, or demi-deciles# 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55% 60% 65% 70% 75% 80% 85% 90% 95% 100% # 0.00 5.00 9.00 13.00 18.00 23.00 29.00 36.00 40.00 45.55 50.00 56.00 61.00 66.00 71.00 75.00 80.00 85.00 90.00 95.00 100.00

…percentiles…

quantile(x, probs = seq(0, 1, 1/100)) # Percentiles# 0% 1% 2% 3% 4% 5% 6% 7% 8% 9% 10% 11% 12% 13% ...# 0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 8.91 9.00 10.00 11.00 12.00 ...

…or permilles:

quantile(x, probs = seq(0, 1, 1/1000)) # Permilles or milliles# 0.0% 0.1% 0.2% 0.3% 0.4% 0.5% 0.6% 0.7% 0.8% 0.9% 1.0% 1.1% 1.2% 1.3% ...# 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.993 1.000 1.000 1.000 1.000 1.000 1.987 ...

Example 6: How to Visualize Quantiles

Quantiles are often used for data visualization, most of the time in so called Quantile-Quantile plots.

Quantile-Quantile plots can be created in R based on the qqplot function. Let’s do this in practice!

First, we need to create a second vector:

y <- x + rnorm(1000, 0, 30) # Create y-data

Now, we can print a qqplot of our two example vectors with the qqplot function as follows:

qqplot(x, y) # Quantile-Quantile plot of x & y

R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (2)

Figure 1: Basic Quantile-Quantile Plot in R.

Video, Further Resources & Summary

Below, you can find a video on the Statistics Globe YouTube channel where I describe the steps of this tutorial in expanded detail:

Quantiles can be a very useful weapon in statistical research. A topic we haven’t talked about yet is the commonly used quantile regression. If you want to learn more about quantile regressions, you can have a look at the following YouTube video of Anders Munk-Nielsen:

Furthermore, you may have a look at the other R tutorials on Statistics Globe:

  • Quantile-Quantile Plot in R
  • Compute Interquartile Range (IQR) in R
  • The Empirical Cumulative Distribution Function (ecdf R Function)
  • The do.call R Function
  • R Functions List (+ Examples)
  • The R Programming Language

At this point, I hope you know how to deal with the quantile function in the R programming language. However, if you have any questions don’t hesitate to let me know in the comments section below.

17 Comments. Leave new

  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (3)

    kumar G

    April 10, 2020 3:09 pm

    do.call(“rbind”,
    tapply(iris$Sepal.Length, # Specify numeric column
    iris$Species, # Specify group variable
    quantile))
    in this scrip how to set the row.names or how to know which row belongs to which category?

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (4)

      Joachim

      April 11, 2020 5:22 pm

      Hi Kumar,

      Thank you for your question. Could you elaborate your question in some more detail? I’m not sure if I understand the question correctly.

      Regards,

      Joachim

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (5)

    Chris

    July 26, 2020 6:34 pm

    Hi, Thanks for the post! I was wondering how would you do if you want a first group which contained between 0-5% and then separate in tertiles (5-100%).

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (6)

      Joachim

      July 27, 2020 5:24 am

      Hey Chris,

      Thanks for the comment! If I understand your question correctly, then this is what is shown in Example 5 – ventiles. Does this solve your problem?

      Regards,

      Joachim

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (7)

    Solène

    November 9, 2021 2:51 am

    Hi Joachim, thank you for your post. It’s very useful, but i have some questions about type argument which you haven’t mention in your post.

    # method for default
    quantile(x, probs = seq(0, 1, 0.25), na.rm = FALSE,
    names = TRUE, type = 7, …)
    type
    an integer between 1 and 9 selecting one of the nine quantile algorithms detailed below to be used.

    i have just some informations, but i can’t understand it very well. Could you please explain to me if you know about the usage of this argument ? Thanks in advance.

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (8)

      Joachim

      November 9, 2021 7:04 am

      Hey Solène,

      First of all, thank you for the kind words! Glad you like the article!

      Regarding your question: The type argument allows you to specify different algorithms for the computation of the quantiles. The different types and formulas are described in the help documentation of the quantile function.

      You may open the help documentation using the code ?quantile, and then you will find a detailed description of the algorithms under the section “Type”.

      Regards,
      Joachim

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (9)

    Nandini

    September 3, 2022 5:02 am

    I am looking at the variable ‘thorax’ from the data ‘fruitflies’ in the faraway package. I ran the R function ‘quantile’ on ‘thorax.
    > data(fruitfly)
    > dim(fruitfly)
    [1] 124 3
    > head(fruitfly)
    thorax longevity activity
    1 0.68 37 many
    2 0.68 49 many
    3 0.72 46 many
    4 0.72 63 many
    5 0.76 39 many
    6 0.76 46 many
    > Deciles Deciles
    10% 20% 30% 40% 50% 60% 70% 80% 90%
    0.72 0.76 0.80 0.82 0.84 0.84 0.88 0.88 0.92

    Identify all the four quintiles:

    I quintile =
    II quintile =
    III quintile =
    IV quintile =

    Can you explain this and let me know the answer. I am a beginner. please help me with this.

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (10)

      Joachim

      September 5, 2022 11:08 am

      Hey Nandini,

      Quintiles can be calculated as shown in the following code:

      quantile(x, probs = seq(0, 1, 1/5))

      You would have to replace x by the data you want to calculate the quintiles for.

      Regards,
      Joachim

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (11)

    Bethany

    October 29, 2022 10:32 pm

    Hi, This is super helpful. I have been trying to obtain 98th percentile from a data frame with grouping. This basically combines example 4 with the first example of 5. I tried this:

    do.call(“rbind”,
    tapply(MT_2.5_2001$Arithmetic.Mean,
    MT_2.5_2001$County.Name,
    quantile( probs=0.98))

    **No default data, so then tried this:

    do.call(“rbind”,
    tapply(MT_2.5_2001$Arithmetic.Mean,
    MT_2.5_2001$County.Name,
    quantile(MT_2.5_2001$Arithmetic.Mean, probs=0.98)))

    With this error code: Error in match.fun(FUN) :
    ‘quantile(MT_2.5_2001$Arithmetic.Mean, probs = 0.98)’ is not a function, character or symbol

    do.call(“rbind”,
    tapply(MT_2.5_2001$Arithmetic.Mean,
    MT_2.5_2001$County.Name,
    quantile(MT_2.5_2001$Arithmetic.Mean, probs=0.98)))

    error in match.fun(FUN) :
    ‘quantile(MT_2.5_2001$Arithmetic.Mean, probs = 0.98)’ is not a function, character or symbol

    and tried using dplyr, which spits out one number, not grouped data

    MT_2.5_2017.98%
    group_by(County.Name) %>%
    summarize(quant98 = ~quantile(Arithmetic.Mean, probs= 0.98))
    MT_2.5_2017.98

    Any suggestions?

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (12)

      Joachim

      November 14, 2022 12:45 pm

      Hi Bethany,

      Thank you so much for the kind words, glad you find my tutorials helpful!

      I apologize for the delayed reply. I was on a long vacation, so unfortunately I wasn’t able to get back to you earlier. Do you still need help with your syntax?

      Regards,
      Joachim

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (13)

    Bethany

    November 14, 2022 3:04 pm

    Hi Joachim, No problem. I either figured it out or found a work around. I am an R novice so doing a lot of learning by internet blogs like yours. Thank you!

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (14)

      Joachim

      November 15, 2022 8:24 am

      Hi Bethany,

      It’s great to hear that you found a solution, and thanks a lot for the very kind words regarding my blog! 🙂

      Regards,
      Joachim

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (15)

    samin

    November 30, 2022 9:21 am

    Hi,

    I have a continuous variable in my dataset with such a distribution:

    summary(emissions$NMVOC_gram)
    Min. 1st Qu. Median Mean 3rd Qu. Max.
    0 256 547 15802 1074 50818630
    how can i categorize this variable to unequal levels of extremely high to extremely low, low, high, and medium in R or excel?

    thank you for the help

    I tried the cut function in r but the result was not what I expected, actually, I do not know how I should define the breaks, in my data the 3rd Qu. is lower than the Mean.

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (16)

      Cansu (Statistics Globe)

      December 14, 2022 10:43 am

      Hi Samin,

      Sorry for the late response. Could you find a solution? Here is what I found by a small search. You can split your values into quantiles via the split_quantiles() function of the fabricatr package. Then you can rename your values. See below:

      #install.packages("fabricatr")library("fabricatr")#Sample Datadata<-data.frame(values=rnorm(n = 100), id=1:100)head(data)# values id# 1 -0.29894200 1# 2 -0.17558019 2# 3 1.17584919 3# 4 -0.01174238 4# 5 1.70457408 5# 6 0.93288038 6#Quantile Bucketsdata$quantiles<-split_quantile(x = data$values, type = 4)#Type of classclass(data$quantiles)#[1] "factor"#Conversion to char for the sake of the rest of the computationdata$quantiles<-as.character(data$quantiles)#Renaming the valuesdata$quantiles[data$quantiles=="1"]<-"Extremely Low" #Similar code lines below yet this time with "2", "3", "4" and "Low"....head(data)# values id quantiles# 1 0.3154646 1 3# 2 0.6964094 2 3# 3 -1.4426406 3 Extremely Low# 4 0.7007485 4 3# 5 -0.6343276 5 Extremely Low# 6 1.3871343 6 4

      Regards,
      Cansu

      Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (17)

    samin

    December 22, 2022 11:19 am

    thank you very much for your help,

    Reply
  • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (18)

    Rokesh

    March 24, 2023 2:18 pm

    how to get a particular quantile value like q1 or q3? help is much appreciated.

    Reply
    • R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (19)

      Cansu (Statistics Globe)

      March 27, 2023 9:01 am

      Hello Rokesh,

      You can use the same code setting shown in the tutorial as follows:

      quantile(x, probs = c(0.25, 0.75))

      Regards,
      Cansu

      Reply

Leave a Reply

I’m Joachim Schork. On this website, I provide statistics tutorials as well as code in Python and R programming.

Statistics Globe Newsletter

Related Tutorials

How to Calculate Correlation Coefficients in R (5 Examples) | cor Function

R quantile Function (6 Examples) | NA, Group, Quartiles, Quintiles, Deciles (2024)

FAQs

What is quantile () in R? ›

In statistics, quantiles are values that divide a ranked dataset into equal groups. The quantile() function in R can be used to calculate sample quantiles of a dataset. This function uses the following basic syntax: quantile(x, probs = seq(0, 1, 0.25), na.rm = FALSE)

What are 25% and 75% quantiles? ›

Centiles/percentiles are descriptions of quantiles relative to 100; so the 75th percentile (upper quartile) is 75% or three quarters of the way up an ascending list of sorted values of a sample. The 25th percentile (lower quartile) is one quarter of the way up this rank order.

What are 5% and 95% quantiles? ›

The 95th percentile is the value where 95% of all measurements are under it, and 5% of measurements are over it. The chart above shows the page load histogram with all of the measurements below the 95th percentile coloured in blue.

What is the 90% quantile? ›

The 90th percentile indicates the point where 90% percent of the data have values less than this number. More generally, the pth percentile is the number n for which p% of the data is less than n.

What does quantile () do? ›

In probability and statistics, the quantile function outputs the value of a random variable such that its probability is less than or equal to an input probability value.

How do you solve for quantile? ›

How to find quantiles
  1. Calculate n * (3 / 5).
  2. If n * (3 / 5) is an integer, then the third 5-quantile is the mean of the numbers at positions n * (3 / 5) and n * (3 / 5) + 1.
  3. If n * (3 / 5) is not an integer, then round it up. The number at this position is the third 5-quantile.
May 20, 2022

What are the 3 special quantiles? ›

Common quantiles have special names, such as quartiles (four groups), deciles (ten groups), and percentiles (100 groups).

What is the 50% quantile? ›

50th Percentile - Also known as the Median. The median cuts the data set in half. Half of the answers lie below the median and half lie above the median. 75th Percentile - Also known as the third, or upper, quartile.

What are the 4 quantiles? ›

Quartiles organize data into three points—a lower quartile, median, and upper quartile—to form four dataset groups. Along with the minimum and maximum values of the data set, the quartiles divide a set of observations into four sections, each representing 25% of the observations.

What is the difference between quantile and percentile in R? ›

Percentiles are given as percent values, values such as 95%, 40%, or 27%. Quantiles are given as decimal values, values such as 0.95, 0.4, and 0.27. The 0.95 quantile point is exactly the same as the 95th percentile point. R does not work with percentiles, rather R works with quantiles.

What is the difference between quartiles and quantiles? ›

Definition: Quartiles and quantiles

Quartiles are a set of three values that divide ordered data sets into four equal parts. Quantiles are units of equal, adjacent quantities in a distribution. Quartiles, percentiles, and deciles are examples of quantiles used in research as statistical quantities.

What is the difference between percentile and centile? ›

A percentile (or a centile) is a measure used in statistics indicating the value below which a given percentage of observations in a group of observations fall. For example, the 20th percentile is the value (or score) below which 20% of the observations may be found.

Is 100 percentile possible? ›

Percentile rank is a number between 0 and 100 indicating the percent of cases falling at or below that score. There is no 0 percentile rank - the lowest score is at the first percentile. There is no 100th percentile - the highest score is at the 99th percentile.

How to convert percentile into percentage? ›

Identify the percentile rank: This is the position you or a data point holds within the total. Calculate the proportion: Divide the percentile rank by the total number of data points or subjects. This gives you a proportion. Convert the proportion to a percentage: Multiply the proportion by 100 to get a percentage.

What is the difference between inclusive and exclusive percentile rank? ›

Inclusive evaluate all values in the data from the smallest value to the largest value. Exclusive evaluates all values in the data between the smallest value and the largest value. The smallest value is calculated as 1/n and the largest value is calculated as 1-1/n where n is the number of listed items.

What is the purpose of quantile regression? ›

Quantile regression is widely seen as an ideal tool to understand complex predictor-response relations. Its biggest promise rests in its ability to quantify whether and how predictor effects vary across response quantile levels.

Is quantile the same as percentile in R? ›

The 0.95 quantile point is exactly the same as the 95th percentile point. R does not work with percentiles, rather R works with quantiles. The R command for this is quantile() where we need to give that function the variable holding the data we are using and we need to give the function one or more decimal values.

Why do we use quantile? ›

Summary. Sample quantiles give a robust description of the distribution of a variable. The sample median can be used to measure the centre of a distribution. The quartiles and interquartile range give a measure of the spread of a distribution.

What is a quantile regression R? ›

Quantile regression is an evolving body of statistical methods for estimating and drawing inferences about conditional quantile functions. An implementation of these methods in the R language is available in the package quantreg.

References

Top Articles
Latest Posts
Article information

Author: Rubie Ullrich

Last Updated:

Views: 5783

Rating: 4.1 / 5 (72 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Rubie Ullrich

Birthday: 1998-02-02

Address: 743 Stoltenberg Center, Genovevaville, NJ 59925-3119

Phone: +2202978377583

Job: Administration Engineer

Hobby: Surfing, Sailing, Listening to music, Web surfing, Kitesurfing, Geocaching, Backpacking

Introduction: My name is Rubie Ullrich, I am a enthusiastic, perfect, tender, vivacious, talented, famous, delightful person who loves writing and wants to share my knowledge and understanding with you.