NEWS

tidypolars 0.18.0

tidypolars requires polars >= 1.10.0.

New features

Added support for the following functions:
- anyNA() (#330)
- cummax() (#323)
- trunc() (#343)
Added decreasing and na.last arguments support to sort() (@Yousa-Mirage, #328).
Added na.last and ties.method arguments support to rank() (@Yousa-Mirage, #329).
Better error message in filter() when a condition uses = instead of == (#341).
count() and add_count() now work with expressions, e.g. count(mtcars, mpg + 1) (#346).
as_tibble() on grouped Polars DataFrames or LazyFrames now returns a grouped tibble (#348).

Bug fixes

Fix NA handling in cummin(), cumprod(), cumsum() (@Yousa-Mirage, #326).
Fix NA handling in is.finite(), is.infinite(), and is.nan() (#331).
In arrange(), if the data was grouped, the order was never maintained even if maintain_order = TRUE was passed in group_by(). This is now fixed (#332).
When exporting to CSV, null_values alone did not apply and could override explicitly provided null_value. This is now fixed (@Yousa-Mirage, #334).
Fix sample() to make it work correctly (@Yousa-Mirage, #338).
Fix unite() behavior when na.rm = TRUE (#344).
Fix a bug in fill() where groups set in .by would be preserved after the operation (hence returning a grouped output) (#348).

tidypolars 0.17.0

tidypolars requires polars >= 1.9.0 and dplyr >= 1.2.0.

Breaking changes and deprecations

The following functions (deprecated since 0.10.0, August 2024) are now removed (#303):
- describe(), use summary() instead.
- describe_plan() and describe_optimized_plan(), use explain(optimized = TRUE/FALSE) instead.
make_unique_id() is deprecated and will be removed in a future version. This is because the underlying Polars function isn't guaranteed to give the same results across different versions. This function doesn't have a replacement in tidypolars (#304).
In partition_by_key() and partition_by_max_size() (both already deprecated in 0.16.0), the argument per_partition_sort_by has been removed (#322).

New features

Added support for dplyr::near() (#311).
pivot_wider() now works with Polars LazyFrames (#318).
Added support for several functions implemented in dplyr 1.2.0:
- filter_out() (#280)
- recode_values() (#308)
- replace_values() (#308)
- replace_when() (#307)
- when_any() (#306)
- when_all() (#306)
separate() now supports regex in the sep argument (#320).

Other changes

Several changes to make tidypolars more aligned with the tidyverse output in general (#316):
- in count(), if sort = TRUE and there are some ties, then other variables are sorted in increasing order.
- coalesce() no longer has a default argument. This was an implementation mistake since dplyr::coalesce() never had this argument.
- ungroup() used to remove the group-specific attributes in the original grouped data, even if the result of the operation was not assigned. This is fixed.
- replace_na() on a Polars DataFrame or LazyFrame now errors if replacement is not a list.
- slice_*() functions on grouped data return columns in the same order as in the input.
- summarize() with only NULL expressions now returns one row per unique group instead of the entire data.
- unite() now returns columns in the correct order, and doesn't duplicate the sep in the output if some values are NA.

Bug fixes

bind_rows_polars() now uses input names in .id if not all inputs are named, for example bind_rows_polars(x1 = x1, x2, .id = "id") (#317).

tidypolars 0.16.0

tidypolars requires polars >= 1.8.0.

New features

New function unnest_longer_polars() to unnest list-columns into rows, equivalent to tidyr::unnest_longer(). It supports the parameters values_to, indices_to, keep_empty, as well as the {col} templates for column naming. (#212, #281, @Yousa-Mirage)
New functions separate_longer_delim_polars() and separate_longer_position_polars() to split string columns into rows by delimiter or fixed width, equivalent to tidyr::separate_longer_delim() and tidyr::separate_longer_position(). (#57, #285, @Yousa-Mirage)
New argument .by in fill() (this was introduced in tidyr 1.3.2). (#283)
wday() now supports arbitrary week_start values (1~7), allowing for custom week start days. (#292, @Yousa-Mirage)
Add support for argument type in nchar (#288).
It is now possible to use translated functions without loading the package they come from. For example, the following code can run without loading stringr in the session:
```
data |>
  mutate(y = .tp$str_extract_stringr(x, "\\d+"))
```
This can be useful to benefit from polars speed while using the interface of tidyverse functions, without adding additional tidyverse dependencies. This may be useful to avoid installing extra dependencies, but it is not the recommended usage because it makes it harder to convert tidypolars code to run with other tidyverse-based backends. More information with ?.tp (#293).
New argument mkdir in write_parquet_polars() (this already existed in sink_parquet()). (#298)
New (experimental) function partition_by() to write partitioned output in sink_*() and write_*_polars(). The following functions are deprecated and will be removed in a future release (#299):
- partition_by_key() can be replaced with partition_by(key =)
- partition_by_max_size() can be replaced with partition_by(max_rows_per_file =)

Changes

collect() now returns a tibble instead of a data.frame, for consistency with other collect() methods (#273).

Bug fixes

arrange() now works with literal values, such as arrange(x, 1:2) (#296).

Documentation

Removed the "FAQ" vignette, which was outdated and wasn't particularly helpful.

tidypolars 0.15.1

tidypolars requires polars >= 1.6.0.

tidypolars 0.15.0

Breaking changes

For consistency with dplyr, distinct() now only keeps the selected columns. To keep all columns, use .keep_all = TRUE (#227, @ppanko).

New features

New argument mkdir in all sink_*() functions to recursively create the folder(s) specified in the path(s) to files (#236).
New functions partition_by_key() and partition_by_max_size() that can be used in the path argument of sink_*() functions. Those enable writing a LazyFrame to several files as partitioned output. See more details in ?sink_parquet() (#237).
bind_cols_polars() now works with more than two LazyFrames (#244).
Add support for gsub() (#250).
Add partial support for stringr::str_equal() (#228).
Add support for lubridate functions rollbackward(), rollback(), and rollforward() (#252).
Support stringr::fixed() in more stringr functions (#250).
Add support for argument ignore.case in grepl() (#251).
Add support for argument .keep_all in distinct() (#227, @ppanko).

Bug fixes

Better error message in group_by() for unsupported argument .drop (#230).
Better error message in group_by() when passing named expressions in .... dplyr supports those but it is more and more recommended to use the .by / by argument in individual functions rather than using group_by() and ungroup() (#238).
Better error message in count() when passing named expressions in ... (#239).
Fix bug in join_where() when all common column names between two DataFrames are used in the join conditions (#254).
Using %in% with NA now retains the NA in the data. Using %in% NA will error (#256).
Remove occasional deprecation message coming from Polars when using %in% (#259, @ppanko).
Better handling of functions prefixed with <pkg>:: (#261).
Fix wrong behavior of paste() and paste0() with collapse (#263).

Documentation

New vignette "How to benchmark tidypolars" (#232).
Better documentation for all read_*() and scan_*() functions (#241).

tidypolars 0.14.1

tidypolars requires polars >= 1.1.0 (#222).

Bug fixes

Fix a corner case when filter() was used in a custom function with missing arguments (#220).
In grepl(), the argument fixed is now used correctly (thanks @gernophil for the report, #223).
if_else() and ifelse() now work when using named arguments (#224).

tidypolars 0.14.0

tidypolars requires polars >= 1.0.0. This release of polars contains many breaking changes. Those should be invisible to tidypolars users, with the exception of deprecation messages (see below). However, if your code contains user-defined functions that use polars syntax, you may need to revise those (#194).

Deprecations and breaking changes

The following arguments are deprecated and will be removed in a future version. The recommended replacement is indicated on the right of the arrow (#194):
- in compute() and collect(): streaming -> engine;
- in read_csv_polars() and scan_csv_polars():
  - dtypes -> schema_overrides
  - reuse_downloaded -> no replacement
- in read_ndjson_polars and scan_ndjson_polars():
  - reuse_downloaded -> no replacement
- in read_ipc_polars and scan_ipc_polars():
  - memory_map -> no replacement
- in write_csv_polars() and sink_csv():
  - null_values -> null_value
  - quote -> quote_char
- in write_ndjson_polars():
  - pretty -> no replacement
  - row_oriented -> no replacement
- in write_ipc_polars():
  - future -> compat_level
fetch() is deprecated, use head() before collect() instead (#194).
group_keys() now returns a tibble and not a data.frame anymore (#194).
lubridate::make_date(), lubridate::make_datetime(), and ISOdatetime() now error if some components go over their expected range, e.g. month = 20 or hour = 25. Before, those functions were returning NA in this situation (#194).
summary() returns an additional row for the 50% percentile (#194).

New features

Added support for various lubridate functions:
- force_tz() and with_tz() (@atsyplenkov, #170);
- date() (@atsyplenkov, #181);
- today() and now() (#183);
- weeks(), days(), hours(), minutes(), seconds(), milliseconds(), microseconds(), nanoseconds() (#184).
tidypolars can now use expressions that contain non-translated functions if those expressions do not use columns from the data.

Example:
```
dat <- pl$DataFrame(foo = c(2, 1, 2))
a <- c("d", "e", "f")
dat |>
  filter(foo >= agrep("a", a))
```
agrep() is not a translated function so this used to error:
```
Error in `filter()`:
! `tidypolars` doesn't know how to translate this function: `agrep()`.
```
However, we see that agrep("a", a) doesn't use any column but instead an object in the environment so it can be evaluated without caring whether tidypolars knows this function or not:
```
shape: (1, 1)
┌─────┐
│ foo │
│ --- │
│ f64 │
╞═════╡
│ 2.0 │
└─────┘
```
Note that this is evaluated before running polars in the background so this expression can't benefit from polars parallel evaluation for instance. Thanks @mgacc0 for the suggestion.
Add support for as.Date() for character columns (#190).
Error messages due to untranslated functions now suggest opening an issue to ask for their translation (#197).
Add support for %>% in expressions (#200).
Add support for dplyr::tally() (#203).
count() and add_count() now warn or error when argument wt is used since it is not supported. The behavior depends on the global option tidypolars_unknown_args (#204).
tidypolars has experimental support for fallback to R when a function is not internally translated to polars syntax. The default behavior is still to error, but the user can now set options(tidypolars_fallback_to_r = TRUE) to handle those unknown functions. See ?tidypolars_options for details on the drawbacks of this approach (#205).
Large performance improvement when using selection helpers (such as contains()) on data with many columns (#211).
tidypolars now exports rules to be used with flir for detecting deprecated functions describe_plan() and describe_optimized_plan(). Those can be used in your project by following this article. Note that this requires flir 0.5.0.9000 or higher (#214).

Bug fixes

Fix behavior of mutate() and summarize() when they don't contain any expression (#191).
Fix error in count() when it includes grouping variables (#193).
Passing . in an anonymous function in across() now works (#216).

tidypolars 0.13.0

New features

Added support for stringr::str_replace_na() (#153).
Better checks for unknown and unsupported arguments in compute(), collect(), *_join(), pivot_*(), sink_*(), slice_sample() and uncount()(#158, thanks @fkohrt for the report). Now, when those functions receive:
- an argument that exists in the tidyverse implementation but not supported by tidypolars, they warn the user. This default behaviour can be changed to error instead with options(tidypolars_unknown_args = "error").
- an argument that doesn't exist at all, they error.
Add support for argument explicit in tidyr::complete().
Add option to keep track of filenames in scan_csv_polars() (#171, @ginolhac).
Add partial support for seq() (argument length.out is not supported) and seq_len().
complete() now accepts named elements, e.g. complete(df, group, value = 1:4) (#176).
Add support for several lubridate functions:
- am(), pm(), leap_year(), days_in_month() (#178);

Bug fixes

Fix edge cases in the tidypolars implementation of stringr::str_sub() and substr() compared to their original implementation (#159).
arrange() now places NA values last, like dplyr.

tidypolars 0.12.0

tidypolars requires polars >= 0.21.0.

Breaking changes

summarize() now drops the last group of the output by default (for consistency with dplyr). Previously it kept the same groups as in the input data (#149).

New features

Add support for argument .groups in summarize(). Value "rowwise" is not supported for now (#149).
Added support for dplyr::lead(). In dplyr::lead() and dplyr::lag(), the arguments default and order_by are now supported (#151).

tidypolars 0.11.0

tidypolars requires polars >= 0.20.0.

Breaking changes

arrange() now errors with unknown variable names (like dplyr::arrange()). Previously, unknown variables were silently ignored. Using expressions (like a + b) is now accepted (#144).
The parameter inherit_optimization is removed from all sink_*() functions.

New features

The power operators ^ and ** now work.
New function sink_ndjson() to write the results of a lazy query to a NDJSON file without collecting it in memory.
inner_join() now accepts inequality joins in the by argument, including the following helpers: between(), overlaps(), within() (#148).

Bug fixes

Using an external object in case_when(), ifelse() and ifelse() now works.
str_sub() doesn't error anymore when start is positive and end is negative.
read_*_polars() functions used to return a standard data.frame by mistake. They now return a Polars DataFrame.
Using [ for subsetting in expressions now works. Thanks @ginolhac for the report (#141).
bind_cols_polars() and bind_rows_polars() now error (as expected before) if elements are a mix of Polars DataFrames and LazyFrames.

tidypolars 0.10.1

Bug fixes

Do not error when handling columns with datatype Null. Note that converting those columns to R with as.data.frame(), as_tibble(), or collect() is still an issue as of polars 0.19.1.

tidypolars 0.10.0

tidypolars requires polars >= 0.19.1.

Breaking changes and deprecations

describe() is deprecated as of tidypolars 0.10.0 and will be removed in a future update. Use summary() with the same arguments instead (#127).
describe_plan() and describe_optimized_plan() are deprecated as of tidypolars 0.10.0 and will be removed in a future update. Use explain() with optimized = TRUE/FALSE instead (#128).
In sink_parquet() and sink_csv(), all arguments except for .data and path must be named (#136).

New features

Add support for more functions:
- from package base: substr().
Better error message when a function can come from several packages but only one version is translated (#130).
row_number() now works without argument (#131).
New functions to import data as Polars DataFrames and LazyFrames (#136):
- read_<format>_polars() to import data as a Polars DataFrame;
- scan_<format>_polars() to import data as a Polars LazyFrame;
- <format> can be "csv", "ipc", "json", "parquet".
Those can replace functions from polars. For example, polars::pl$read_parquet(...) can be replaced by read_parquet_polars(...).
New functions to write Polars DataFrames to external files: write_<format>_polars() where <format> can be "csv", "ipc", "json", "ndjson", "parquet" (#136).
New function sink_ipc() that is similar to sink_parquet() and sink_csv() but for IPC files (#136).
across() now throws a better error message when the user passes an external list to .fns. This works with dplyr but cannot work with tidypolars (#135).
Added support for argument .add in group_by().

Bug fixes

stringr::str_sub() now works when both start and end are negative.
Fixed a bug in str_sub() when start was greater than 1.
stringr::str_starts() and stringr::str_ends() now work with a regex.
fill() doesn't error anymore when ... is empty. Instead, it returns the input data.
unite() now provides a proper error message when col is missing.
unite() doesn't error anymore when ... is empty. Instead, it uses all variables in the dataset.
filter(), mutate() and summarize() now work when using a column from another data.frame, e.g.
```
my_polars_df |>
  filter(x %in% some_data_frame$y)
```
replace_na() no longer converts the column to the datatype of the replacement, e.g. data |> replace_na("a") will error if the input data is numeric.
n_distinct() now correctly applies the na.rm argument when several columns are passed as input (#137).

tidypolars 0.9.0

tidypolars requires polars >= 0.18.0.

New features

Add support for several functions:
- from package base: %% and %/%.
- from package dplyr: dense_rank(), row_number().
- from package lubridate: wday().
Better handling of missing values to match R behavior. In the following functions, if there is at least one missing value and na.rm = FALSE (the default), then the output will be NA: max(), mean(), median(), min(), sd(), sum(), var() (#120).
New argument cluster_with_columns in collect(), compute(), and fetch().
Add a global option tidypolars_unknown_args to control what happens when tidypolars doesn't know how to handle an argument in a function. The default is to warn and the only other accepted value is "error".

Bug fixes

count() and add_count() no longer overwrite a variable named n if the argument name is unspecified.

tidypolars 0.8.0

tidypolars requires polars >= 0.17.0.

Breaking changes

As announced in tidypolars 0.7.0, the behavior of collect() has changed. It now returns a standard R data.frame and not a Polars DataFrame anymore. Replace collect() by compute() (with the same arguments) to keep the old behavior.
In bind_rows_polars(), if .id is passed, the resulting column now is of type character instead of integer.

New features

Add support for several functions:
- from package base: all(), any(), diff(), ISOdatetime(), length(), rev(), unique().
- from package dplyr: consecutive_id(), min_rank(), na_if(), n_distinct(), nth().
- from package lubridate: make_datetime().
- from package stringr: str_dup(), str_split(), str_split_i(), str_trunc().
- from package tidyr: replace_na() (the data.frame method was already translated but not the vector one that can be used in mutate() for example).
It is now possible to use explicit namespaces (such as dplyr::first() instead of first()) in mutate(), summarize() and filter() (#114).
In bind_rows_polars(), if all elements are named and .id is specified, the .id column will use the names of the elements (#116).
It is now possible to rename variables in select() (#117).
Add support for argument na_matches in all join functions (except cross_join() that doesn't need it) (#109).

Bug fixes

Local variables in custom functions could not be used in tidypolars functions (reported in a blog post of Art Steinmetz). This is now fixed.
across() now works when .cols contains only one variable and .fns contains only one function.

In across(), the .cols argument now takes into account variables created in the same mutate() or summarize() call before across().

as_polars_df(mtcars) |>
  head(n = 3) |>
  mutate(
    foo = 1,
    across(.cols = contains("oo"), \(x) x - 1)
  )

shape: (3, 12)
┌──────┬─────┬───────┬───────┬───┬─────┬──────┬──────┬─────┐
│ mpg  ┆ cyl ┆ disp  ┆ hp    ┆ … ┆ am  ┆ gear ┆ carb ┆ foo │
│ ---  ┆ --- ┆ ---   ┆ ---   ┆   ┆ --- ┆ ---  ┆ ---  ┆ --- │
│ f64  ┆ f64 ┆ f64   ┆ f64   ┆   ┆ f64 ┆ f64  ┆ f64  ┆ f64 │
╞══════╪═════╪═══════╪═══════╪═══╪═════╪══════╪══════╪═════╡
│ 21.0 ┆ 6.0 ┆ 160.0 ┆ 110.0 ┆ … ┆ 1.0 ┆ 4.0  ┆ 4.0  ┆ 0.0 │
│ 21.0 ┆ 6.0 ┆ 160.0 ┆ 110.0 ┆ … ┆ 1.0 ┆ 4.0  ┆ 4.0  ┆ 0.0 │
│ 22.8 ┆ 4.0 ┆ 108.0 ┆ 93.0  ┆ … ┆ 1.0 ┆ 4.0  ┆ 1.0  ┆ 0.0 │
└──────┴─────┴───────┴───────┴───┴─────┴──────┴──────┴─────┘

Note that the where() function is not supported here. For example:

as_polars_df(mtcars) |>
  mutate(
    foo = 1,
    across(.cols = where(is.numeric), \(x) x - 1)
  )

will not return 0 for the variable foo. A warning is emitted about this behavior.

Better handling of negative values in c() when called in mutate() and summarize().

tidypolars 0.7.0

tidypolars requires polars >= 0.16.0.

Breaking changes and deprecations

as_polars() is now removed. It was deprecated in 0.6.0. Use as_polars_df() or as_polars_lf() instead.
to_r() is now removed. It was deprecated in 0.6.0. Use as.data.frame() or as_tibble() instead.
For consistency with dplyr, the behavior of collect() will change in 0.8.0 as it will perform the lazy query and convert the result to a standard data.frame. For now, collect() only throws a warning about this future change. It is recommended to use compute() to only perform the query and get a Polars DataFrame as output (#101).

New features

Several improvements and changes for pivot_wider() (#95):
- names_from can now takes several variables;
- add support for id_cols and names_glue;
- default value of names_sep now is _, for consistency with tidyr;
- fix documentation as pivot_wider() doesn't work on LazyFrame.
Add support for stringr::regex(). Note that only the argument ignore_case is supported for now (#97).
Add support for several lubridate functions: dweeks(), ddays(), dhours(), dminutes(), dseconds(), dmilliseconds(), make_date() (#107).
When a polars function called internally fails, the original error message is now displayed.
Add support for group_split() (for DataFrame only).
Add support for argument relationship in left_join(), right_join(), full_join() and inner_join() (#106).

tidypolars 0.6.0

tidypolars requires polars >= 0.15.0.

Breaking changes and deprecations

as_polars() is deprecated and will be removed in 0.7.0. Use as_polars_lf() or as_polars_df() instead.
as_polars() doesn't have an argument with_string_cache anymore. When set to TRUE, this enabled the string cache globally, which could lead to undesirable side effects.
to_r() is deprecated and will be removed in 0.7.0. Use as.data.frame() or as_tibble() instead. This used to silently return a LazyFrame if the input was LazyFrame. It now automatically collects the LazyFrame (#88).
pull() nows automatically collects input LazyFrame (#89).

New features

Add support for argument .keep in mutate() (#80).
Add support for group_vars() and group_keys() (#81).
Experimental support of rowwise(). For now, this is limited to a few functions: mean(), median(), min(), max(), sum(), all(), any(). rowwise() and group_by() cannot be used at the same time (#40).
All functions that return a polars Data/LazyFrame now add the class "tidypolars" to the output (#86).
Support which.min(), which.max(), dplyr::n().
Support .data[[ and .env[[ in addition to .data$ and .env$. Better error messages when the objects specified in .data or .env don't exist.

Bug fixes

pull() now errors when var is of length > 1.

tidypolars 0.5.0

tidypolars requires polars >= 0.12.0.

Breaking changes

across() now errors if the argument .cols is not provided (either named or unnamed). This behavior was deprecated in dplyr 1.1.0.
It is no longer possible to use ! in arrange() to sort by decreasing order, for compatibility with dplyr::arrange(). Use - or desc() instead.

New features

summarize() now works on ungrouped data and returns a 1-row output.
It is now possible to use desc(x1) in arrange() to sort in decreasing order of x1 (this is equivalent to -x1).
Add support for argument names_prefix in pivot_longer().
Add support for arguments names_prefix and names_sep in pivot_wider().
Add support for tidyr::uncount().
All *_join() functions now work when by is a specification created by dplyr::join_by(). Notice that this is limited to equality joins for now.
You can now use the "embrace" operator {{ }} to pass unquoted column names (among other things) as arguments of custom functions. See the "Programming with dplyr" vignette for some examples.
bind_cols_polars() now works with two LazyFrames, but not more.
Add support for argument .name_repair in bind_cols_polars() (#74).
Support for .env$ and .data$ pronouns in expressions of filter(), mutate() and summarize().
Support named vector in the argument pattern of str_replace_all(), where names are patterns and values are replacements.
Using %in% for factor variables doesn't require enabling the string cache anymore.

Bug fixes

summarize() no longer errors when across(everything(), ...) is used with .by.
All *_join() functions no longer error when a named vector is provided in the argument by.
Expressions with values only are not named "literal" anymore.

Misc

Simplify the procedure to support new functions.

tidypolars 0.4.0

tidypolars requires polars >= 0.11.0.

Breaking changes

It is no longer possible to pass a list in rename().

New features

The argument with_string_cache in as_polars() now enables the string cache globally if set to TRUE (#54).
Better error message in filter() when comparing factors to strings while the string cache is disabled.
Basic support for strptime(). It is possible to use strptime(*, strict = FALSE) to not error when the parsing of some characters fails.
New argument .by in filter(), mutate(), and summarize(), and new argument by in the slice_*() functions. This allows to do operations on groups without using group_by() and ungroup(). See the dplyr vignette for more information (#59).
rename() now accepts unquoted names both old and new names.
Support fixed regexes in str_detect() (using fixed()) and in grepl() (using fixed = TRUE).

Bug fixes

Improve robustness of sequential expressions in mutate() and summarize() (i.e expressions that should be run one after the other because they depend on variables created in the same call) (#58).
relocate() now works correctly when .after = last_col().
All functions that work on grouped data now correctly restore the groups structure (#62).

Misc

Error messages coming from mutate(), summarize(), and filter() now give the right function call.
Faster tidy selection (#61).

tidypolars 0.3.0

tidypolars requires polars >= 0.10.0.

Breaking changes

All functions starting with pl_ have been removed to the benefit of the S3 methods. For example, pl_distinct() doesn't exist anymore so the only way to use it is to load dplyr and to use distinct() on a Polars DataFrame or LazyFrame. This is to avoid confusion about compatibility with dplyr and tidyr. See #49 for a more detailed explanation.
pl_bind_rows() and pl_bind_cols() are renamed bind_rows_polars() and bind_cols_polars() respectively. This is because bind_rows() and bind_cols() are not S3 methods (this might change in future versions of dplyr).

New features

New function duplicated_rows() that is the opposite of distinct() (#50).
New argument .id in bind_rows_polars().
bind_rows_polars() can now bind Data/LazyFrames that don't have the same schema. Columns will be upcast to common types if necessary. Unknown columns will be filled with NA.

Bug fixes

complete() now works correctly on grouped data.

Misc

relig_income and fish_encounters are not reexported anymore since tidyr is now imported.

tidypolars 0.2.0

tidypolars requires polars >= 0.9.0.

New features

Rename pl_fetch() to fetch().
New functions supported: describe(), sink_csv(), slice_sample().
New argument fill in pl_complete().
Support stringr::str_to_title() and tools::toTitleCase().
Support stringr::fixed() to use literal strings.
Support replacements with captured groups like \\1 in stringr::str_replace() and stringr::str_replace_all().

Bug fixes

sink_parquet() didn't use the user inputs (apart from the path).

Misc

Clearer error message when an expression contains <pkg>::. This is not supported for now but could potentially be implemented later.
pl_colnames() is no longer exported.

tidypolars 0.1.0

New features

Support as.numeric(), as.character(), as.logical(), grepl(), and paste() in expressions in pl_filter(), pl_mutate() and pl_summarize().
Support sink_parquet() (#38).
Support fetch() (#42).
Support for additional stringr functions: str_detect(), str_extract_all(), str_pad(), str_squish(), str_trim(), word() (some arguments or corner cases are not supported yet).
Add all optimization parameters in collect().

Bug fixes

Fix pl_mutate() and pl_summarize() when expressions use some variables previously created or modified (#10, #37).
Fix bug in pl_filter() when passing a vector in the RHS of %in%.

Misc

Improve the backend to translate R expressions into Polars expressions. This also led to a complete rewriting of the vignette "R and Polars expressions" (#27).
Error messages should now report the correct function call.
Improve CI coverage (#35).

tidypolars 0.0.1

First Github release.