Skip to content

Commit c6d52d6

Browse files
authored
Merge branch 'main' into main
2 parents db8aff0 + f6cfd23 commit c6d52d6

File tree

5 files changed

+27
-2
lines changed

5 files changed

+27
-2
lines changed

README.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -47,11 +47,36 @@ data = pd.DataFrame({
4747
'PetalWidthCm':[0.2, 0.1, 0.2],
4848
'Species':['Iris-setosa','Iris-virginica', 'Iris-germanica']
4949
})
50+
51+
data_with_NA = pd.DataFrame({
52+
'SepalLengthCm':[5.1, 4.9, 4.7],
53+
'SepalWidthCm':[1.4, 1.4, 1.3],
54+
'PetalWidthCm':[0.2, 0.1, None]
55+
})
56+
57+
data_with_outlier = pd.DataFrame({
58+
'SepalLengthCm':[5.1, 4.9, 4.7, 5.2, 5.1, 5.2, 5.1, 4.8],
59+
'SepalWidthCm':[1.4, 1.4, 1.3, 1.2, 1.2, 1.3, 1.6, 1.3],
60+
'PetalWidthCm':[0.2, 0.1, 30, 0.2, 0.3, 0.1, 0.4, 0.5]
61+
})
5062
```
5163

5264
The eda_utils_py will help you to:
5365
- Diagnose data quality: Resolve skewed data by identifing missing data and outlier and provide corresponding remedy.
5466

67+
```python
68+
imputer(data_with_NA)
69+
```
70+
Output:
71+
72+
![imputer_output](images/imputer_output.png)
73+
74+
```python
75+
outlier_identifier(data_with_outlier, method = "median")
76+
```
77+
Output:
78+
79+
![outlier_output](images/outlier_output.png)
5580

5681
- This package can help you easily plot a correlation matrix along with its values to help explore data.
5782

eda_utils_py/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
__version__ = '0.1.6'
1+
__version__ = '0.1.7'

images/imputer_output.png

4 KB
Loading

images/outlier_output.png

8 KB
Loading

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[tool.poetry]
22
name = "eda_utils_py"
3-
version = "0.1.6"
3+
version = "0.1.7"
44
description = "Python package that contains util functions for eda process"
55
authors = ["Chuang Wang <chuangw.sde@gmail.com>"]
66
license = "MIT"

0 commit comments

Comments
 (0)