Feature Selection Techniques – Recursive Feature Elimination (RFE)
300320201719 It is a greedy optimization algorithm which aims to find the best performing feature subset. It repeatedly creates models and keeps aside the best […]
300320201719 It is a greedy optimization algorithm which aims to find the best performing feature subset. It repeatedly creates models and keeps aside the best […]
300320201313 In backward elimination, we start with all the features and removes the least significant feature at each iteration which improves the performance of the […]
300320201248 Forward selection is an iterative method in which we start with no function in the model. In each iteration, we add a function that […]
290320202006 Collinearity is the state where two variables are highly correlated and contain similar information about the variance within a given dataset. The Variance Inflation […]
290320201454 In [1]: import numpy as np import pandas as pd import seaborn as sns import matplotlib.pyplot as plt from sklearn.preprocessing import LabelEncoder, OneHotEncoder import warnings […]
280320200940 Source of data: https://archive.ics.uci.edu/ml/datasets/Air+Quality In this case, statistical methods are used: We always have continuous and discrete variables in the data set. This procedure […]
categorical input – categorical output 260320201223 In this case, statistical methods are used: We always have continuous and discrete variables in the data set. This […]
230320200907 Principal component analysis (PCA) https://jakevdp.github.io/PythonDataScienceHandbook/05.08-random-forests.html https://www.geeksforgeeks.org/principal-component-analysis-with-python/ In [1]: import pandas as pd import numpy as np import seaborn as sns import matplotlib.pyplot as plt df= […]
200320200724 In [1]: import pandas as pd df = pd.read_csv(’/home/wojciech/Pulpit/1/kaggletrain.csv’) df = df.dropna(how=’any’) df.dtypes Out[1]: Unnamed: 0 int64 PassengerId int64 Survived int64 Pclass int64 Name […]
Przy budowie modelów ML występuje problrm dobrania najlepszych parametrów. Poniżej dowiemy się jak dobierać optymalne hiperparametry dla modelu. źródło: https://towardsdatascience.com/optimizing-hyperparameters-in-random-forest-classification-ec7741f9d3f6 https://chrisalbon.com/machine_learning/model_evaluation/plot_the_validation_curve/ In [1]: import numpy as […]
Copyright © 2026 | WordPress Theme by MH Themes