blob: 3848865849e74c599f1269515878f6d3e6abe308 [file] [log] [blame]
Hao Zhu6a076462017-03-01 12:59:01 -05001---
2title: "Create Awesome LaTeX Table with knitr::kable and kableExtra"
3author: "Hao Zhu"
4date: "`r Sys.Date()`"
5output:
6 pdf_document:
7 toc: true
8 toc_depth: 2
Hao Zhu014d6212017-08-07 04:20:23 +08009header-includes:
10 - \usepackage{booktabs}
11 - \usepackage{longtable}
12 - \usepackage{array}
13 - \usepackage{multirow}
14 - \usepackage[table]{xcolor}
15 - \usepackage{wrapfig}
Hao Zhu245931c2017-09-01 22:43:56 -040016 - \usepackage{float}
Hao Zhu65216052017-09-04 13:01:29 -040017 - \usepackage{colortbl}
18 - \usepackage{pdflscape}
19 - \usepackage{tabu}
20 - \usepackage{threeparttable}
Hao Zhu014d6212017-08-07 04:20:23 +080021vignette: >
22 %\VignetteIndexEntry{Create Awesome PDF Table with knitr::kable and kableExtra}
23 %\VignetteEngine{knitr::rmarkdown}
24 %\VignetteEncoding{UTF-8}
Hao Zhu6a076462017-03-01 12:59:01 -050025---
26
Hao Zhue1be9602017-08-17 15:44:31 -040027> Please see the package [documentation site](http://haozhu233.github.io/kableExtra) for how to use this package in HTML and more.
Hao Zhu014d6212017-08-07 04:20:23 +080028
Hao Zhu6a076462017-03-01 12:59:01 -050029# Overview
30The goal of `kableExtra` is to help you build common complex tables and manipulate table styles. It imports the pipe `%>%` symbol from `magrittr` and verbalize all the functions, so basically you can add "layers" to a kable output in a way that is similar with `ggplot2` and `plotly`.
31
Hao Zhu0a0e8332017-08-03 13:21:29 -040032To learn how to generate complex tables in LaTeX, please visit [http://haozhu233.github.io/kableExtra/awesome_table_in_html.html](http://haozhu233.github.io/kableExtra/awesome_table_in_html.html).
33
Hao Zhu6a076462017-03-01 12:59:01 -050034# Installation
Hao Zhu6a076462017-03-01 12:59:01 -050035```r
Hao Zhu74eb6ad2017-03-04 09:32:37 -050036install.packages("kableExtra")
37
Hao Zhu6a076462017-03-01 12:59:01 -050038# For dev version
Hao Zhuf9aa4c42017-05-22 15:53:35 -040039# install.packages("devtools")
Hao Zhu6a076462017-03-01 12:59:01 -050040devtools::install_github("haozhu233/kableExtra")
41```
Hao Zhuf9aa4c42017-05-22 15:53:35 -040042
Hao Zhu6a076462017-03-01 12:59:01 -050043# Getting Started
44Here we are using the first few columns and rows from dataset `mtcars`
45```{r}
46library(knitr)
47library(kableExtra)
48dt <- mtcars[1:5, 1:6]
49```
50
Hao Zhu65216052017-09-04 13:01:29 -040051When you are using `kable()`, if you don't specify `format`, by default it will generate a markdown table and let pandoc handle the conversion from markdown to HTML/PDF. This is the most favorable approach to render most simple tables as it is format independent. If you switch from HTML to pdf, you basically don't need to change anything in your code. However, markdown doesn't support complex table. For example, if you want to have a double-row header table, markdown just cannot provide you the functionality you need. As a result, when you have such a need, you should **define `format` in `kable()`** as either "html" or "latex". *You can also define a global option at the beginning using `options(knitr.table.format = "latex")` so you don't repeat the step everytime.* **In this tutorial, I'll still put `format="latex"` in the function in case users just want to quickly replicate the results.**
Hao Zhu6a076462017-03-01 12:59:01 -050052
53```{r}
54options(knitr.table.format = "latex")
55## If you don't define format here, you'll need put `format = "latex"`
56## in every kable function.
57```
58
Hao Zhuc761ee52017-09-04 13:04:35 -040059## LaTeX packages used in this package
Hao Zhu65216052017-09-04 13:01:29 -040060If you are using a recent version of rmarkdown, you are recommended to load this package entirely via `library(kableExtra)` or `require(kableExtra)` because this package will load all necessary LaTeX packages, such as `booktabs` or `multirow`, for you automatically. Note that, if you are calling functions from `kableExtra` via `kableExtra::kable_styling()` or if you put `library(kableExtra)` in a separate R file that is **sourced** by the rmarkdown document, these packages won't be loaded. Furthermore, you can suppress this auto-loading behavior by setting a global option `kableExtra.latex.load_packages` to be `FALSE` before you load `kableExtra`.
61
Hao Zhub350b9f2017-09-04 13:07:57 -040062```{r, eval = FALSE}
Hao Zhu2e318eb2017-09-04 13:18:42 -040063# Not evaluated. Ilustration purpose
Hao Zhu65216052017-09-04 13:01:29 -040064options(kableExtra.latex.load_package = FALSE)
Hao Zhu2e318eb2017-09-04 13:18:42 -040065library(kableExtra)
Hao Zhu65216052017-09-04 13:01:29 -040066```
67
68If you are using R Sweave or some customized rmarkdown templates or you just want to load these LaTeX libraries by yourself, you can put the following meta data into the `yaml` section. If you are familar with LaTeX and you know what you are doing, feel free to remove unnecessary packages from the list.
69
Hao Zhub350b9f2017-09-04 13:07:57 -040070```{yaml}
Hao Zhu65216052017-09-04 13:01:29 -040071header-includes:
72 - \usepackage{booktabs}
73 - \usepackage{longtable}
74 - \usepackage{array}
75 - \usepackage{multirow}
76 - \usepackage[table]{xcolor}
77 - \usepackage{wrapfig}
78 - \usepackage{float}
79 - \usepackage{colortbl}
80 - \usepackage{pdflscape}
81 - \usepackage{tabu}
82 - \usepackage{threeparttable}
83```
84
Hao Zhu6a076462017-03-01 12:59:01 -050085## Plain LaTeX
86Plain LaTeX table looks relatively ugly in 2017.
87```{r}
88kable(dt)
89```
90
Hao Zhu0a0e8332017-08-03 13:21:29 -040091## LaTeX table with booktabs
Hao Zhu6a076462017-03-01 12:59:01 -050092Similar with Bootstrap in HTML, in LaTeX, you can also use a trick to make your table look prettier as well. The different part is that, this time you don't need to pipe kable outputs to another function. Instead, you should call `booktabs = T` directly in `kable()`
93```{r}
Hao Zhu65216052017-09-04 13:01:29 -040094kable(dt, format = "latex", booktabs = T)
Hao Zhu6a076462017-03-01 12:59:01 -050095```
96
97# Table Styles
98`kable_styling` in LaTeX uses the same syntax and structure as `kable_styling` in HTML. However, instead of `bootstrap_options`, you should specify `latex_options` instead.
99
Hao Zhu0a0e8332017-08-03 13:21:29 -0400100## LaTeX options
Hao Zhu6a076462017-03-01 12:59:01 -0500101Similar with `bootstap_options`, `latex_options` is also a charter vector with a bunch of options including `striped`, `hold_position` and `scale_down`.
102
103### Striped
104Even though in the LaTeX world, people usually call it `alternative row colors` but here I'm using its bootstrap name for consistency. Note that to make it happen, LaTeX package `xcolor` is required to be loaded. In an environment like rmarkdown::pdf_document (rmarkdown 1.4.0 +), `kable_styling` will load it automatically if `striped` is enabled. However, in other cases, you probably need to import that package by yourself.
105```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400106kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500107 kable_styling(latex_options = "striped")
108```
109
Hao Zhu0a0e8332017-08-03 13:21:29 -0400110### Hold position
Hao Zhu6a076462017-03-01 12:59:01 -0500111If you provide a table caption in `kable()`, it will put your LaTeX tabular in a `table` environment, unless you are using `longtable`. A `table` environment will automatically find the best place (it thinks) to put your table. However, in many cases, you do want your table to appear in a position you want it to be. In this case, you can use this `hold_position` options here.
112```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400113kable(dt, format = "latex", caption = "Demo table", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500114 kable_styling(latex_options = c("striped", "hold_position"))
115```
116
117### Scale down
Hao Zhuf9aa4c42017-05-22 15:53:35 -0400118When you have a wide table that will normally go out of the page and you want to scale down the table to fit the page, you can use the `scale_down` option here. Note that, if your table is too small, it will also scale up your table. It was named in this way only because scaling up isn't very useful in most cases.
Hao Zhu6a076462017-03-01 12:59:01 -0500119```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400120kable(cbind(dt, dt, dt), format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500121 kable_styling(latex_options = c("striped", "scale_down"))
122```
123```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400124kable(cbind(dt), format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500125 kable_styling(latex_options = c("striped", "scale_down"))
126```
127
Hao Zhu0a0e8332017-08-03 13:21:29 -0400128### Repeat header in longtable
129In `kableExtra` 0.3.0 or above, a new option `repeat_header` was introduced into `kable_styling`. It will add header rows to longtables spanning multiple pages. For table captions on following pages, it will append *"continued"* to the caption to differentiate. If you need texts other than *"(continued)"* (for example, other languages), you can specify it using `kable_styling(..., repeat_header_text = "xxx")`. If you want to complete replace the table caption instead of appending, you can specify it in the option `repeat_header_method`.
Hao Zhu6ff9d502017-06-13 17:13:03 -0400130```{r}
131long_dt <- rbind(mtcars, mtcars)
132
Hao Zhu65216052017-09-04 13:01:29 -0400133kable(long_dt, format = "latex", longtable = T, booktabs = T, caption = "Longtable") %>%
Hao Zhu6ff9d502017-06-13 17:13:03 -0400134 add_header_above(c(" ", "Group 1" = 5, "Group 2" = 6)) %>%
135 kable_styling(latex_options = c("repeat_header"))
136```
137
Hao Zhu6a076462017-03-01 12:59:01 -0500138
Hao Zhu0a0e8332017-08-03 13:21:29 -0400139## Full width?
Hao Zhuf9aa4c42017-05-22 15:53:35 -0400140If you have a small table and you want it to spread wide on the page, you can try the `full_width` option. Unlike `scale_down`, it won't change your font size. Note that, if you use `full_width` in LaTeX, you will loss your in-cell text alignment settings and everything will be left-aligned.
Hao Zhu6a076462017-03-01 12:59:01 -0500141```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400142kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500143 kable_styling(full_width = T)
144```
145
146## Position
147Table Position only matters when the table doesn't have `full_width`. You can choose to align the table to `center` or `left` side of the page. The default value of position is `center`.
148
149Note that even though you can select to `right` align your table but the table will actually be centered. Somehow it is very difficult to right align a table in LaTeX (since it's not very useful in the real world?). If you know how to do it, please send out an issue or PR and let me know.
150```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400151kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500152 kable_styling(position = "center")
153```
154
155Becides these three common options, you can also wrap text around the table using the `float-left` or `float-right` options. Note that, like `striped`, this feature will load another non-default LaTeX package `wrapfig` which requires rmarkdown 1.4.0 +. If you rmarkdown version < 1.4.0, you need to load the package through a customed LaTeX template file.
156```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400157kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500158 kable_styling(position = "float_right")
159```
160Lorem ipsum dolor sit amet, consectetur adipiscing elit. Cras sit amet mauris in ex ultricies elementum vel rutrum dolor. Phasellus tempor convallis dui, in hendrerit mauris placerat scelerisque. Maecenas a accumsan enim, a maximus velit. Pellentesque in risus eget est faucibus convallis nec at nulla. Phasellus nec lacinia justo. Morbi fermentum, orci id varius accumsan, nibh neque porttitor ipsum, consectetur luctus risus arcu ac ex. Aenean a luctus augue. Suspendisse et auctor nisl. Suspendisse cursus ultrices quam non vulputate. Phasellus et pharetra neque, vel feugiat erat. Sed feugiat elit at mauris commodo consequat. Sed congue lectus id mattis hendrerit. Mauris turpis nisl, congue eget velit sed, imperdiet convallis magna. Nam accumsan urna risus, non feugiat odio vehicula eget.
161
162## Font Size
163If one of your tables is huge and you want to use a smaller font size for that specific table, you can use the `font_size` option.
164```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400165kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500166 kable_styling(font_size = 7)
167```
168
Hao Zhu0a0e8332017-08-03 13:21:29 -0400169# Column / Row Specification
170## Column spec
171When you have a table with lots of explanatory texts, you may want to specified the column width for different column, since the auto adjust in HTML may not work in its best way while basic LaTeX table is really bad at handling text wrapping. Also, sometimes, you may want to highlight a column (e.g. a "Total" column) by making it bold. In these scenario, you can use `column_spec()`. You can find an example below.
172```{r}
173text_tbl <- data.frame(
174 Items = c("Item 1", "Item 2", "Item 3"),
175 Features = c(
176 "Lorem ipsum dolor sit amet, consectetur adipiscing elit. Proin vehicula tempor ex. Morbi malesuada sagittis turpis, at venenatis nisl luctus a. ",
177 "In eu urna at magna luctus rhoncus quis in nisl. Fusce in velit varius, posuere risus et, cursus augue. Duis eleifend aliquam ante, a aliquet ex tincidunt in. ",
178 "Vivamus venenatis egestas eros ut tempus. Vivamus id est nisi. Aliquam molestie erat et sollicitudin venenatis. In ac lacus at velit scelerisque mattis. "
179 )
180)
181
Hao Zhu65216052017-09-04 13:01:29 -0400182kable(text_tbl, format = "latex", booktabs = T) %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400183 kable_styling(full_width = F) %>%
Hao Zhua44e3752017-09-05 12:56:19 -0400184 column_spec(1, bold = T, color = "red") %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400185 column_spec(2, width = "30em")
186```
187
188## Row spec
189Similar with `column_spec`, you can define specifications for rows. Currently, you can either bold or italiciz an entire row. Note that, similar with other row-related functions in `kableExtra`, for the position of the target row, you don't need to count in header rows or the group labelling rows.
190
191```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400192kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400193 kable_styling("striped", full_width = F) %>%
Hao Zhua44e3752017-09-05 12:56:19 -0400194 column_spec(7, border_left = T) %>%
195 row_spec(5, bold = T, color = "white", background = "black")
Hao Zhu0a0e8332017-08-03 13:21:29 -0400196```
197
198# Grouped Columns / Rows
199## Add header rows to group columns
Hao Zhu6a076462017-03-01 12:59:01 -0500200Tables with multi-row headers can be very useful to demonstrate grouped data. To do that, you can pipe your kable object into `add_header_above()`. The header variable is supposed to be a named character with the names as new column names and values as column span. For your convenience, if column span equals to 1, you can ignore the `=1` part so the function below can be written as `add_header_above(c(" ", "Group 1" = 2, "Group 2" = 2, "Group 3" = 2)).
201```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400202kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500203 kable_styling() %>%
204 add_header_above(c(" " = 1, "Group 1" = 2, "Group 2" = 2, "Group 3" = 2))
205```
206
Hao Zhu916c3662017-06-21 15:55:05 -0400207In fact, if you want to add another row of header on top, please feel free to do so. Also, in kableExtra 0.3.0, you can specify `bold` & `italic` as you do in `row_spec()`.
Hao Zhu6a076462017-03-01 12:59:01 -0500208```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400209kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu00ba87c2017-08-01 12:42:58 -0400210 kable_styling(latex_options = "striped") %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500211 add_header_above(c(" ", "Group 1" = 2, "Group 2" = 2, "Group 3" = 2)) %>%
212 add_header_above(c(" ", "Group 4" = 4, "Group 5" = 2)) %>%
Hao Zhu916c3662017-06-21 15:55:05 -0400213 add_header_above(c(" ", "Group 6" = 6), bold = T, italic = T)
Hao Zhu6a076462017-03-01 12:59:01 -0500214```
215
Hao Zhu0a0e8332017-08-03 13:21:29 -0400216## Group rows via labeling
217Sometimes we want a few rows of the table being grouped together. They might be items under the same topic (e.g., animals in one species) or just different data groups for a categorical variable (e.g., age < 40, age > 40). With the new function `group_rows()` in `kableExtra`, this kind of task can be completed in one line. Please see the example below. Note that when you count for the start/end rows of the group, you don't need to count for the header rows nor other group label rows. You only need to think about the row numbers in the "original R dataframe".
218```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400219kable(mtcars[1:10, 1:6], format = "latex", caption = "Group Rows", booktabs = T) %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400220 kable_styling() %>%
221 group_rows("Group 1", 4, 7) %>%
222 group_rows("Group 2", 8, 10)
223```
224
225In case some users need it, you can define your own gapping spaces between the group labeling row and previous rows. The default value is `0.5em`.
226```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400227kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400228 group_rows("Group 1", 4, 5, latex_gap_space = "2em")
229```
230
231## Row indentation
232Unlike `group_rows()`, which will insert a labeling row, sometimes we want to list a few sub groups under a total one. In that case, `add_indent()` is probably more apporiate.
233For advanced users, you can even define your own css for the group labeling.
234```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400235kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400236 add_indent(c(1, 3, 5))
237```
238
239## Group rows via multi-row cell
240Function `group_rows` is great for showing simple structural information on rows but sometimes people may need to show structural information with multiple layers. When it happens, you may consider to use `collapse_rows` instead, which will put repeating cells in columns into multi-row cells.
241
242```{r}
243collapse_rows_dt <- data.frame(C1 = c(rep("a", 10), rep("b", 5)),
244 C2 = c(rep("c", 7), rep("d", 3), rep("c", 2), rep("d", 3)),
245 C3 = 1:15,
246 C4 = sample(c(0,1), 15, replace = TRUE))
Hao Zhu65216052017-09-04 13:01:29 -0400247kable(collapse_rows_dt, format = "latex", booktabs = T, align = "c") %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400248 column_spec(1, bold=T) %>%
249 collapse_rows(columns = 1:2)
250```
251
252```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400253kable(collapse_rows_dt, format = "latex", align = "c") %>%
Hao Zhu0a0e8332017-08-03 13:21:29 -0400254 column_spec(1, bold = T, width = "5em") %>%
255 collapse_rows(1:2)
256```
257
258# Table Footnote
259## Notation system
Hao Zhu6a076462017-03-01 12:59:01 -0500260You can also use `add_footnote()` function from this package. You will need to supply a character vector with each element as one footnote. You may select from `number`, `alphabet` and `symbol` for different types of notations. Example are listed below.
261
262### Alphabet
263```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400264kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500265 kable_styling() %>%
266 add_footnote(c("Footnote 1", "Have a good day."), notation = "alphabet")
267```
268
269### Number
270```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400271kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500272 kable_styling() %>%
273 add_footnote(c("Footnote 1", "Have a good day."), notation = "number")
274```
275
276### Symbol
277```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400278kable(dt, format = "latex", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500279 kable_styling() %>%
280 add_footnote(c("Footnote 1", "Footnote 2", "Footnote 3"), notation = "symbol")
281```
282
283## In-table markers
284By design, `add_footnote()` will transform any `[note]` to in-table footnote markers.
285```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400286kable(dt, format = "latex", caption = "Demo Table[note]", booktabs = T) %>%
Hao Zhu6a076462017-03-01 12:59:01 -0500287 kable_styling(latex_options = "hold_position") %>%
288 add_header_above(c(" ", "Group 1[note]" = 3, "Group 2[note]" = 3)) %>%
289 add_footnote(c("This table is from mtcars",
290 "Group 1 contains mpg, cyl and disp",
291 "Group 2 contains hp, drat and wt"),
292 notation = "symbol")
293```
Hao Zhuf9aa4c42017-05-22 15:53:35 -0400294
Hao Zhu0a0e8332017-08-03 13:21:29 -0400295# LaTeX Only Features
296## Table on a Landscape Page
Hao Zhu4278c632017-05-24 01:02:50 -0400297Sometimes when we have a wide table, we want it to sit on a designated landscape page. The new function `landscape()` can help you on that. Unlike other functions, this little function only serves LaTeX and doesn't have a HTML side.
298```{r}
Hao Zhu65216052017-09-04 13:01:29 -0400299kable(dt, format = "latex", caption = "Demo Table (Landscape)[note]", booktabs = T) %>%
Hao Zhu4278c632017-05-24 01:02:50 -0400300 kable_styling(latex_options = c("hold_position")) %>%
301 add_header_above(c(" ", "Group 1[note]" = 3, "Group 2[note]" = 3)) %>%
302 add_footnote(c("This table is from mtcars",
303 "Group 1 contains mpg, cyl and disp",
304 "Group 2 contains hp, drat and wt"),
305 notation = "symbol") %>%
306 group_rows("Group 1", 4, 5) %>%
307 landscape()
308```