Home > Backend Development > Python Tutorial > How Can I Create Conditional Columns in a DataFrame Based on Existing Column Values?

How Can I Create Conditional Columns in a DataFrame Based on Existing Column Values?

Barbara Streisand
Release: 2024-12-21 07:27:09
Original
729 people have browsed it

How Can I Create Conditional Columns in a DataFrame Based on Existing Column Values?

Creating Conditional Columns Based on Existing Column Values

In data analysis, it's often necessary to create new columns whose values are determined based on conditions derived from existing columns. Consider the scenario where you have a DataFrame with two columns: "Type" and "Set," and you want to add a new column called "color" that follows specific rules.

Adding a Color Column Based on Set Values

To create a "color" column where the values are "green" if "Set" is "Z" and "red" otherwise, you can use the following approach:

1

2

3

import numpy as np

 

df['color'] = np.where(df['Set'] == 'Z', 'green', 'red')

Copy after login

This code utilizes the np.where function, which selects values based on a condition. If the "Set" column value is "Z," the "color" value becomes "green"; otherwise, it becomes "red."

Using np.select for More Complex Conditions

For more complex scenarios where you have multiple conditions, you can use np.select. For instance, suppose you want to assign colors according to the following rules:

  • "yellow" if both "Set" is "Z" and "Type" is "A"
  • "blue" if "Set" is "Z" and "Type" is "B"
  • "purple" if "Type" is "B"
  • "black" otherwise

1

2

3

4

5

6

conditions = [

    (df['Set'] == 'Z') & (df['Type'] == 'A'),

    (df['Set'] == 'Z') & (df['Type'] == 'B'),

    (df['Type'] == 'B')]

choices = ['yellow', 'blue', 'purple']

df['color'] = np.select(conditions, choices, default='black')

Copy after login

The np.select function takes a list of conditions and a corresponding list of choices. If the condition is met, the associated choice is selected; otherwise, the default value is used.

These methods provide versatile options for creating conditional columns based on existing column values, allowing you to manipulate and analyze your data efficiently.

The above is the detailed content of How Can I Create Conditional Columns in a DataFrame Based on Existing Column Values?. For more information, please follow other related articles on the PHP Chinese website!

source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Latest Articles by Author
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template