GroupBy Example - Pandas Typing CST Test
Loading…
GroupBy Example — Pandas Code
Groups a DataFrame by a column and calculates aggregate statistics.
import pandas as pd
# Sample DataFrame
data = {'Department': ['HR','IT','HR','IT'], 'Salary':[50000,60000,55000,65000]}
df = pd.DataFrame(data)
# Group by department and get mean salary
grouped = df.groupby('Department')['Salary'].mean()
print(grouped)Pandas Language Guide
Pandas is an open-source Python library that provides high-performance, easy-to-use data structures and data analysis tools for working with structured (tabular, multidimensional, and time-series) data.
Primary Use Cases
- ▸Data cleaning, wrangling, and preprocessing
- ▸Exploratory data analysis (EDA) and statistics
- ▸Time-series analysis and financial data handling
- ▸Merging, joining, and reshaping datasets
- ▸Integration with visualization and ML frameworks
Notable Features
- ▸High-performance data structures (DataFrame, Series)
- ▸Label-based and integer-based indexing
- ▸Handling missing data with ease
- ▸Time-series functionality including date ranges and frequency conversion
- ▸Integration with CSV, Excel, SQL, JSON, and other file formats
Origin & Creator
Pandas was created by Wes McKinney in 2008 while he was at AQR Capital to provide a powerful tool for quantitative analysis and financial data processing in Python.
Industrial Note
Pandas is widely used in finance, data science, analytics, scientific computing, machine learning preprocessing, and any domain that requires structured data analysis.