When I apply across columns in this example I get a white space for positive valued numbers but not for negative values? Why is this? Shouldn\'t paste0 remove whitespace be
Since you are already using dplyr, a dplyr solution is to use
stop_latlon %>% rowwise() %>%
summarise(latlon = paste0(lat, "%7", lon))
# A tibble: 10 x 1
latlon
<chr>
1 via:1.222988975822%7-0.0916195541513781
2 via:0.159343465931011%72.13195314768885
3 via:-1.20468509249113%70.207717129395512
4 via:-0.134019685121819%7-0.912028913867691
5 via:-0.279895116522155%71.93812564387851
6 via:1.34379237820276%70.500525410068601
7 via:0.808272181619927%7-0.942578996972991
8 via:-1.17359899808855%70.126116638988962
9 via:1.1859602145711%7-1.00865269561505
10 via:1.77635906904826%70.685722866041471
Using a tibble
instead of a data.frame
by default will not convert your vector to a factor, which I think is desirable in this instance.
Additionally, regarding your question about paste0
, it does not remove whitespace between words, it just doesn't add them when concatenating. str_trim
in the stringr package will trim whitespace for you.
stop_latlon <- data.frame(lat = paste0("via:", rnorm(10)),
lon = rnorm(10), stringsAsFactors = FALSE)
library(stringr)
stop_latlon %>%
apply(1, function(x) paste0(str_trim(x), collapse = "%7"))
Will also provide the desired result.
paste0
doesn't add white space - nor does it remove it. You can test this by just calling paste0
on your vector.
apply
runs on matrices and arrays, not data frames. When you pass a data frame to apply
, it is coerced to a matrix. The main thing about a matrix, of course, is that all elements must be the same type. Since strings or factors can't generally be coerced to numerics, your numeric is coerced to a string or factor to match the first column. If you examine as.matrix.data.frame
, you'll see that format
is used for this conversion, and ?format
shows a default trim = FALSE
that says
trim
; ifFALSE
, logical, numeric and complex values are right-justified to a common width: ifTRUE
the leading blanks for justification are suppressed.
So there's your problem!
paste
and paste0
are vectorized, so there's no reason to apply
them one row at a time. You can just paste the columns together directly:
with(stop_latlon, paste0(lat, "%7", lon))
In a more complicated case where apply
really would be necessary, the solution would be to handle your own matrix conversion rather than relying on apply
to do it with defaults. If you made all the columns strings before passing the data to apply
, (or if you used a character matrix instead of a data frame), the conversion would be straightforward (or unnecessary).