Python exercise [03_1_encoding_scaling]: xgboost python is about to support categorical data while R hasn't yet.

> However, it says "the feature is experimental and has limited features. Only the Python package is fully supported". So in R it's not supported and mlr3 also does not support this as of now Afaik. So would propose to keep it to be for now but open an issue that needs to be resolved in future when this categorical support is not experimental anymore. Also it looks like it's basically internally doing one hot encoding: "categorical data the split is defined depending on whether partitioning or onehot encoding is used. For partition-based splits, the splits are specified as 
value $\in$ categories , where categories is the set of categories in one feature. If onehot encoding is used instead, then the split is defined as value == category "

_Originally posted by @giuseppec in https://github.com/slds-lmu/lecture_appml/pull/26#discussion_r2092388934_
            

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Python exercise [03_1_encoding_scaling]: xgboost python is about to support categorical data while R hasn't yet. #35

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Python exercise [03_1_encoding_scaling]: xgboost python is about to support categorical data while R hasn't yet. #35

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions