Exploring Data Manipulation with Pandas in Python - Part 2
2. Data Structures: Series and DataFrames
Pandas introduces two fundamental data structures - Series and DataFrames. In this part, we'll delve deeper into Pandas Series, a versatile one-dimensional array-like structure that serves as the building block for more complex data manipulations.
Creating a Pandas Series:
To create a Series, you can pass a list or array-like object to the pd.Series()
constructor:
import pandas as pd
data = [10, 20, 30, 40, 50]
series = pd.Series(data)
print(series)
Labeling and Indexing:
Pandas Series come with an index that labels each element, allowing for efficient data retrieval. You can customize the index to match your data's context:
series = pd.Series(data, index=['A', 'B', 'C', 'D', 'E'])
print(series['B'])
Element-Wise Operations:
Series enable element-wise operations, making it easy to perform calculations across the entire Series:
temperature = pd.Series([25, 30, 28, 32, 27])
temperature_celsius = (temperature - 32) * 5 / 9
print(temperature_celsius)
Filtering and Boolean Indexing:
You can filter a Series based on conditions using boolean indexing:
above_average = temperature[temperature > 28]
print(above_average)
Handling Missing Data:
Pandas Series provide methods to handle missing data, such as dropna()
and fillna()
:
data = pd.Series([10, 20, None, 40, 50])
data_cleaned = data.dropna()
data_filled = data.fillna(0)
print(data_cleaned)
print(data_filled)
Exploring Series Attributes:
Series objects include useful attributes, such as values
and index
, which provide access to the underlying data and index labels:
print(series.values)
print(series.index)
Conclusion:
Pandas Series are a fundamental building block for data manipulation in Pandas. With their flexible indexing, element-wise operations, and seamless integration with other Pandas functionalities, Series empower you to efficiently work with one-dimensional data. In the next part of this series, we'll dive into the more complex counterpart - DataFrames - and explore their versatile capabilities in data analysis and manipulation.
0 Comments