Gang Of Coders
Home
About Us
Contact Us
All Dataframe Solutions on Gang of Coders
Total of 619 Dataframe Solutions
How to change the order of DataFrame columns?
Python
Pandas
Dataframe
How to delete multiple pandas (python) dataframes from memory to save RAM?
Python
Memory Management
Pandas
Dataframe
Ram
Efficient way to unnest (explode) multiple list columns in a pandas DataFrame
Python
Json
Pandas
Dataframe
Pandas Explode
Add empty columns to a dataframe with specified names from a vector
R
Dataframe
Simultaneously merge multiple data.frames in a list
R
List
Merge
Dataframe
R Faq
Split (explode) pandas dataframe string entry to separate rows
Python
Pandas
Numpy
Dataframe
Detect and exclude outliers in a pandas DataFrame
Python
Pandas
Filtering
Dataframe
Outliers
Using Pandas to pd.read_excel() for multiple worksheets of the same workbook
Python
Excel
Pandas
Dataframe
Xlsx
Difference between DataFrame, Dataset, and RDD in Spark
Dataframe
Apache Spark
Apache Spark-Sql
Rdd
Apache Spark-Dataset
pandas - find first occurrence
Python
Pandas
Dataframe
Group By
Find
How do I tell if a column in a pandas dataframe is of type datetime? How do I tell if a column is numerical?
Python
Pandas
Numpy
Dataframe
Python Pandas iterate over rows and access column names
Python
Pandas
Dataframe
Series
Store numpy.array in cells of a Pandas.DataFrame
Python
Pandas
Numpy
Dataframe
Count occurrences of False or True in a column in pandas
Python
Pandas
Dataframe
Count
Specifying row names when reading in a file
R
Csv
Dataframe
Rowname
Create dataframe from a matrix
R
Matrix
Dataframe
Add new column in Pandas DataFrame Python
Python
Pandas
Dataframe
Update a dataframe in pandas while iterating row by row
Python
Pandas
Updates
Dataframe
python pandas- apply function with two arguments to columns
Python
Function
Pandas
Dataframe
Extracting the first day of month of a datetime type column in pandas
Python
Pandas
Dataframe
Datetime64
Add column to dataframe with constant value
Python
Pandas
Dataframe
How to take column-slices of dataframe in pandas
Python
Pandas
Numpy
Dataframe
Slice
Adding a column to a dataframe in R
R
Dataframe
Converting a data frame to xts
R
Dataframe
Coerce
Xts
Time Series
Practical limits of R data frame
R
Performance
Dataframe
Rcpp
Select the first and last row by group in a data frame
R
Dataframe
Aggregate
How do I change a single value in a data.frame?
R
Dataframe
Cell
python pandas dataframe slicing by date conditions
Python
Dataframe
Pandas
How to convert a data frame column to numeric type?
R
Dataframe
Type Conversion
Split data frame string column into multiple columns
R
String
Dataframe
Split
R Faq
python pandas replacing strings in dataframe with numbers
Python
Replace
Dataframe
Pandas
Sorting by absolute value without changing the data
Python
Pandas
Sorting
Dataframe
How do I detect if a Spark DataFrame has a column
Scala
Apache Spark
Dataframe
Apache Spark-Sql
How to flatten a pandas dataframe with some columns as json?
Python
Json
Pandas
Dataframe
Flatten
Pandas - dataframe groupby - how to get sum of multiple columns
Python
Pandas
Dataframe
Pandas Groupby
How to split a dataframe string column into two columns?
Python
Dataframe
Pandas
Selecting columns in R data frame based on those *not* in a vector
R
Dataframe
Subset
Create an ID (row number) column
R
Dataframe
R Faq
Replace string/value in entire DataFrame
Python
Replace
Dataframe
Pandas
Replace value for a selected cell in pandas DataFrame without using index
Python
Pandas
Dataframe
Return row number(s) for a particular value in a column in a dataframe
R
Numbers
Dataframe
Row
Set MultiIndex of an existing DataFrame in pandas
Python
Pandas
Dataframe
Indexing
Multi Index
Append a column to Data Frame in Apache Spark 1.3
Scala
Apache Spark
Dataframe
USING LIKE inside pandas.query()
Python
Pandas
Dataframe
Replace values in a dataframe based on lookup table
R
Dataframe
Lookup
PySpark: multiple conditions in when clause
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
Create a set from a series in pandas
Python
Pandas
Dataframe
Series
Kaggle
Remove an entire column from a data.frame in R
R
Dataframe
How do I transpose dataframe in pandas without index?
Python
Pandas
Dataframe
Spark dataframe: collect () vs select ()
Dataframe
Apache Spark
Apache Spark-Sql
Change column type in pandas
Python
Pandas
Dataframe
Types
Casting
Get a list from Pandas DataFrame column headers
Python
Pandas
Dataframe
Create a Pandas Dataframe by appending one row at a time
Python
Pandas
Dataframe
Append
How to add a new column to an existing DataFrame?
Python
Pandas
Dataframe
Chained Assignment
Convert Pandas column containing NaNs to dtype `int`
Python
Pandas
Dataframe
Dtype
Combine two data frames by rows (rbind) when they have different sets of columns
R
Dataframe
R Faq
Merge or combine by rownames
R
Merge
Dataframe
How to order a data frame by one descending and one ascending column?
R
Sorting
Dataframe
Getting imported json data into a data frame
Json
R
Import
Dataframe
How to add a cumulative column to an R dataframe using dplyr?
R
Dataframe
Dplyr
Calculate summary statistics of columns in dataframe
Python
Pandas
Csv
Dataframe
Profiling
list output truncated - How to expand listed variables with str() in R
R
Dataframe
Output
Truncated
DataFrame equality in Apache Spark
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Rdd
How do I check for equality using Spark Dataframe without SQL Query?
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Derive multiple columns from a single column in a Spark DataFrame
Scala
Apache Spark
Dataframe
Apache Spark-Sql
User Defined-Functions
Fillna in multiple columns in place in Python Pandas
Python
Pandas
Dataframe
Pandas - Slice large dataframe into chunks
Python
Pandas
Dataframe
Slice
Python pandas: how to remove nan and -inf values
Python
Python 3.x
Pandas
Numpy
Dataframe
Tilde sign in pandas DataFrame
Python
Pandas
Dataframe
How do I create a multiline plot using seaborn?
Python
Python 3.x
Dataframe
Plot
Seaborn
Find row where values for column is maximal in a pandas DataFrame
Python
Pandas
Dataframe
Row
Argmax
Replace characters from a column of a data frame R
R
Replace
Dataframe
Convert Python list to pandas Series
Python
List
Pandas
Dataframe
Series
Unimplemented type list when trying to write.table
R
Dataframe
write.table
dataframe.describe() suppress scientific notation
Python
Pandas
Dataframe
Remove first x number of characters from each row in a column of a Python dataframe
Python
String
Pandas
Dataframe
Replace
Python Pandas replace multiple columns zero to Nan
Python
Pandas
Dataframe
Data Cleaning
Python generating a list of dates between two dates
Python
Pandas
Dataframe
Find unique values in a Pandas dataframe, irrespective of row or column location
Python
Pandas
Dataframe
Pandas DataFrame to List of Dictionaries
Python
List
Dictionary
Pandas
Dataframe
Creating a pandas DataFrame from columns of other DataFrames with similar indexes
Python
Pandas
Dataframe
Max and Min date in pandas groupby
Python
Pandas
Dataframe
Convert Pandas Series to DateTime in a DataFrame
Python
Datetime
Pandas
Dataframe
Truncate `TimeStamp` column to hour precision in pandas `DataFrame`
Python
Pandas
Datetime
Dataframe
Error - replacement has [x] rows, data has [y]
R
Dataframe
subtract value from previous row by group
R
Dataframe
Lag
Filtering DataFrame using the length of a column
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
_corrupt_record error when reading a JSON file into Spark
Python
Json
Dataframe
Pyspark
Accessing every 1st element of Pandas DataFrame column containing lists
Python
Pandas
Dataframe
Pandas expand rows from list data available in column
Python
List
Pandas
Dataframe
Expand
concat series onto dataframe with column name
Pandas
Dataframe
Rename
Series
How to do a conditional count after groupby on a Pandas Dataframe?
Python
Pandas
Dataframe
Pandas Groupby
How do I delete rows in a data frame?
R
Dataframe
Row
Fast vectorized merge of list of data.frames by row
R
Performance
List
Merge
Dataframe
cbind a dataframe with an empty dataframe - cbind.fill?
R
Dataframe
Cbind
assign headers based on existing row in dataframe in R
R
Dataframe
Names
adding dummy columns to the original dataframe
Python
Pandas
Dataframe
One Hot-Encoding
Override column types when importing data using readr::read_csv() when there are many columns
R
Csv
File Io
Dataframe
Dplyr
how to read certain columns from Excel using Pandas - Python
Python
Numpy
Pandas
Dataframe
Fetching distinct values on a column using Spark DataFrame
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Spark Dataframe
Python Pandas update a dataframe value from another dataframe
Python
Pandas
Dataframe
How to drop rows of Pandas DataFrame whose value in a certain column is NaN
Python
Pandas
Dataframe
Nan
How to show full column content in a Spark Dataframe?
Apache Spark
Dataframe
Spark Csv
Output Formatting
Filtering Pandas DataFrames on dates
Python
Datetime
Pandas
Filtering
Dataframe
What is the difference between join and merge in Pandas?
Python
Pandas
Dataframe
Join
How to print pandas DataFrame without index
Python
Datetime
Pandas
Dataframe
How to deal with SettingWithCopyWarning in Pandas
Python
Pandas
Dataframe
Chained Assignment
Merge two dataframes by index
Python
Pandas
Dataframe
Merge
Concat
How to get rid of "Unnamed: 0" column in a pandas DataFrame read in from CSV file?
Python
Pandas
Csv
Dataframe
Python pandas Filtering out nan from a data selection of a column of strings
Python
Pandas
Dataframe
Pandas create empty DataFrame with only column names
Python
Pandas
Dataframe
Rename specific column(s) in pandas
Python
Pandas
Dataframe
Rename
Add x and y labels to a pandas plot
Python
Pandas
Dataframe
Matplotlib
Use a list of values to select rows from a Pandas dataframe
Python
Pandas
Dataframe
Pandas DataFrame Groupby two columns and get counts
Python
Pandas
Dataframe
Select rows in pandas MultiIndex DataFrame
Python
Pandas
Dataframe
Slice
Multi Index
Why isn't my Pandas 'apply' function referencing multiple columns working?
Python
Python 2.7
Pandas
Dataframe
Apply
Logical operators for Boolean indexing in Pandas
Python
Pandas
Dataframe
Boolean
Filtering
Convert list of dictionaries to a pandas DataFrame
Python
Dictionary
Pandas
Dataframe
How to filter rows containing a string pattern from a Pandas dataframe
Python
Pandas
Dataframe
How do I make a list of data frames?
R
List
Dataframe
R Faq
How to display pandas DataFrame of floats using a format string for columns?
Python
Python 2.7
Pandas
Ipython
Dataframe
Pandas DataFrame: replace all values in a column, based on condition
Python
Pandas
Dataframe
How to show all columns' names on a large pandas dataframe?
Python
Pandas
Dataframe
Options
How can I map True/False to 1/0 in a Pandas DataFrame?
Python
Pandas
Dataframe
Numpy
Boolean
Remove rows with all or some NAs (missing values) in data.frame
R
Dataframe
Filter
Missing Data
R Faq
Pretty-print an entire Pandas Series / DataFrame
Python
Pandas
Dataframe
Drop data frame columns by name
R
Dataframe
R Faq
How to loop over grouped Pandas dataframe?
Python
Pandas
Dataframe
Iteration
Pandas Groupby
How to find which columns contain any NaN value in Pandas dataframe
Python
Pandas
Dataframe
Nan
How do I retrieve the number of columns in a Pandas data frame?
Python
Pandas
Dataframe
Replacing blank values (white space) with NaN in pandas
Python
Pandas
Dataframe
Concatenate a list of pandas dataframes together
Python
Pandas
Dataframe
Concat
Writing a pandas DataFrame to CSV file
Python
Csv
Pandas
Dataframe
How to split data into 3 sets (train, validation and test)?
Pandas
Numpy
Dataframe
Machine Learning
Scikit Learn
How to add multiple columns to pandas dataframe in one assignment?
Python
Pandas
Dataframe
Pandas dataframe get first row of each group
Python
Pandas
Dataframe
Group By
Row
How to get the last N rows of a pandas DataFrame?
Python
Pandas
Dataframe
Compare two DataFrames and output their differences side-by-side
Python
Pandas
Dataframe
Pandas dataframe fillna() only some columns in place
Python
Pandas
Dataframe
Reshaping data.frame from wide to long format
R
Dataframe
Reshape
R Faq
How to create a dictionary of two pandas DataFrame columns
Python
Dictionary
Pandas
Dataframe
Numbering rows within groups in a data frame
R
Dataframe
R Faq
Rename Pandas DataFrame Index
Python
Pandas
Dataframe
Rename
Turn Pandas Multi-Index into column
Python
Pandas
Dataframe
Flatten
Multi Index
How to replace NaNs by preceding or next values in pandas DataFrame?
Python
Python 3.x
Pandas
Dataframe
Nan
python dataframe pandas drop column using int
Python
Pandas
Dataframe
Multiple aggregations of the same column using pandas GroupBy.agg()
Python
Pandas
Dataframe
Aggregate
Pandas Groupby
Find column whose name contains a specific string
Python
Python 3.x
String
Pandas
Dataframe
Can pandas automatically read dates from a CSV file?
Python
Date
Types
Dataframe
Pandas
How do I replace NA values with zeros in an R dataframe?
R
Dataframe
Find the unique values in a column and then sort them
Python
Pandas
Sorting
Dataframe
Unique
Determine the data types of a data frame's columns
R
Dataframe
Types
datetime dtypes in pandas read_csv
Python
Csv
Datetime
Pandas
Dataframe
pandas dataframe columns scaling with sklearn
Python
Pandas
Scikit Learn
Dataframe
Python Pandas - Find difference between two data frames
Python
Pandas
Dataframe
Convert data.frame column to a vector?
R
Dataframe
Vector
Type Conversion
Add missing dates to pandas dataframe
Python
Date
Plot
Pandas
Dataframe
How to test if a string contains one of the substrings in a list, in pandas?
Python
String
Pandas
Dataframe
Match
add a string prefix to each value in a string column using Pandas
Python
String
Pandas
Dataframe
How to iterate over rows in a DataFrame in Pandas
Python
Pandas
Dataframe
How to convert a table to a data frame
R
Dataframe
Pandas: sum DataFrame rows for given columns
Python
Pandas
Dataframe
Sum
Convert row names into first column
R
Dataframe
Col
Rowname
How to form tuple column from two columns in Pandas
Python
Dataframe
Pandas
Tuples
For each row in an R dataframe
R
Dataframe
Rows
Order data frame rows according to vector with specific order
R
Sorting
Dataframe
Remove unwanted parts from strings in a column
Python
String
Pandas
Dataframe
How to combine multiple conditions to subset a data-frame using "OR"?
R
Conditional
Dataframe
How to access pandas groupby dataframe by key
Python
Pandas
Dataframe
Group By
Pandas Groupby
Compare two data.frames to find the rows in data.frame 1 that are not present in data.frame 2
R
Merge
Compare
Rows
Dataframe
Pretty Printing a pandas dataframe
Python
Pandas
Dataframe
Printing
getting the index of a row in a pandas apply function
Python
Pandas
Dataframe
How to unnest (explode) a column in a pandas DataFrame, into multiple rows
Python
Pandas
Dataframe
Pandas Explode
extract column value based on another column pandas dataframe
Python
Pandas
Dataframe
Repeat each row of data.frame the number of times specified in a column
R
Dataframe
Replicate
Call apply-like function on each row of dataframe with multiple arguments from each row
R
Dataframe
How are iloc and loc different?
Python
Pandas
Dataframe
Indexing
Pandas Loc
Combine two columns of text in pandas dataframe
Python
Pandas
Dataframe
How to add a constant column in a Spark DataFrame?
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
How to read a .xlsx file using the pandas Library in iPython?
Python
Pandas
Ipython
Ipython Notebook
Dataframe
Return multiple columns from pandas apply()
Python
Pandas
Dataframe
Apply
Determine the number of NA values in a column
R
Dataframe
How to add a row to a data frame in R?
R
Dataframe
How to select the first row of each group?
Sql
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Convert data.frame column format from character to factor
R
Dataframe
Character
R Faq
pandas unique values multiple columns
Python
Pandas
Dataframe
Unique
Find the column name which has the maximum value for each row
Python
Pandas
Dataframe
Max
Remove columns from dataframe where ALL values are NA
R
Apply
Dataframe
Aggregate / summarize multiple variables per group (e.g. sum, mean)
R
Dataframe
data.table
Aggregate
R Faq
Filter data.frame rows by a logical condition
R
Dataframe
Subset
R Faq
Drop columns whose name contains a specific string from pandas DataFrame
Python
Pandas
Dataframe
Insert a row to pandas dataframe
Python
Pandas
Dataframe
Insert
How to replace text in a string column of a Pandas dataframe?
Python
Replace
Pandas
Dataframe
Save Dataframe to csv directly to s3 Python
Python
Csv
Amazon S3
Dataframe
Boto3
Fastest way to replace NAs in a large data.table
R
Performance
Dataframe
data.table
Add new row to dataframe, at specific row-index, not appended?
R
Dataframe
Insert
Find maximum value of a column and return the corresponding row values using Pandas
Python
Pandas
Dataframe
Max
Convert unix time to readable date in pandas dataframe
Python
Pandas
Unix Timestamp
Dataframe
Get total of Pandas column
Python
Pandas
Dataframe
Sum
How do I add a new column to a Spark DataFrame (using PySpark)?
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
How to get the first column of a pandas DataFrame as a Series?
Python
Dataframe
Pandas
Series
Concatenate columns in Apache Spark DataFrame
Sql
Apache Spark
Dataframe
Apache Spark-Sql
Creating an empty Pandas DataFrame, then filling it?
Python
Dataframe
Pandas
How to create a DataFrame of random integers with Pandas?
Python
Pandas
Dataframe
Size
Shapes
data.frame rows to a list
List
R
Dataframe
Assign multiple columns using := in data.table, by group
R
Dataframe
data.table
Variable Assignment
Colon Equals
Python pandas: fill a dataframe row by row
Python
Dataframe
Row
Pandas
Dynamically select data frame columns using $ and a character value
R
Dataframe
R Faq
Filter pandas DataFrame by substring criteria
Python
String
Pandas
Dataframe
Should I use a data.frame or a matrix?
R
Matrix
Dataframe
R Faq
What rules does Pandas use to generate a view vs a copy?
Python
Pandas
Dataframe
Indexing
Chained Assignment
How to "select distinct" across multiple data frame columns in pandas?
Python
Pandas
Dataframe
Duplicates
Distinct
Omit rows containing specific column of NA
R
Dataframe
Na
How to get a value from a Pandas DataFrame and not the index and object type
Python
Pandas
Dataframe
What is dtype('O'), in pandas?
Python
Pandas
Numpy
Dataframe
Types
Binning a column with Python Pandas
Python
Pandas
Numpy
Dataframe
Binning
How to convert index of a pandas dataframe into a column
Python
Pandas
Dataframe
Indexing
Series
How to count the NaN values in a column in pandas DataFrame
Python
Pandas
Dataframe
Shuffle DataFrame rows
Python
Pandas
Dataframe
Permutation
Shuffle
How to filter Pandas dataframe using 'in' and 'not in' like in SQL
Python
Pandas
Dataframe
Sql Function
R - Concatenate two dataframes?
R
Dataframe
Concatenation
How to save a data.frame in R?
R
Dataframe
pandas: How do I split text in a column into multiple rows?
Python
Pandas
Dataframe
Appending a list or series to a pandas DataFrame as a row?
Python
Pandas
Append
Dataframe
Making heatmap from pandas DataFrame
Python
Pandas
Dataframe
Heatmap
Convert row to column header for Pandas DataFrame,
Python
Pandas
Rename
Dataframe
How to convert a dataframe to a dictionary
Python
Pandas
Dataframe
Dictionary
Truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all()
Python
Pandas
Dataframe
Boolean
Filtering
Set value for particular cell in pandas DataFrame using index
Python
Pandas
Dataframe
Cell
Nan
Get statistics for each group (such as count, mean, etc) using pandas GroupBy?
Python
Pandas
Dataframe
Group By
Pandas Groupby
Python Pandas replace NaN in one column with value from corresponding row of second column
Python
Pandas
Dataframe
Nan
Fillna
Pass a data.frame column name to a function
R
Dataframe
R Faq
Subset data frame based on multiple conditions
R
Dataframe
Subset
How to select all columns whose names start with X in a pandas DataFrame
Python
Pandas
Dataframe
Selection
How can I make pandas dataframe column headers all lowercase?
Python
Pandas
Dataframe
Compare two columns using pandas
Python
Pandas
If Statement
Dataframe
Filter Pyspark dataframe column with None value
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
How do I select rows from a DataFrame based on column values?
Python
Pandas
Dataframe
Renaming column names in Pandas
Python
Pandas
Replace
Dataframe
Rename
How to succinctly write a formula with many variables from a data frame?
R
Dataframe
Glm
Lm
How to create empty data frame with column names specified in R?
R
Dataframe
pandas how to check dtype for all columns in a dataframe?
Python
Pandas
Dataframe
How to select rows with NaN in particular column?
Python
Pandas
Dataframe
Reshape three column data frame to matrix ("long" to "wide" format)
R
Matrix
Dataframe
Plyr
Reshape
Count number of rows within each group
R
Dataframe
Aggregate
R Faq
Find the max of two or more columns with pandas
Python
Dataframe
Pandas
Python pandas insert list into a cell
Python
List
Pandas
Insert
Dataframe
Python: pandas merge multiple dataframes
Python
Pandas
Dataframe
Merge
Data Analysis
Display all dataframe columns in a Jupyter Python Notebook
Python
Python 3.x
Dataframe
Jupyter Notebook
Import multiple csv files into pandas and concatenate into one DataFrame
Python
Pandas
Csv
Dataframe
Concatenation
Add column in dataframe from list
Python
Pandas
Dataframe
Convert a row of a data frame to vector
R
Vector
Dataframe
How to define partitioning of DataFrame?
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Partitioning
Split delimited strings in a column and insert as new rows
R
Dataframe
Reshape
Data Manipulation
Strsplit
Combine two or more columns in a dataframe into a new column with a new name
R
Dataframe
Multiple Columns
R Faq
Convert pandas dataframe to NumPy array
Python
Arrays
Pandas
Numpy
Dataframe
python pandas dataframe columns convert to dict key and value
Python
Pandas
Dataframe
Dictionary
Data Conversion
Converting a column within pandas dataframe from int to string
Python
String
Pandas
Dataframe
Int
What values are valid in Pandas 'Freq' tags?
Python
Pandas
Documentation
Dataframe
Frequency
Creating a zero-filled pandas data frame
Python
Pandas
Dataframe
Change value of variable with dplyr
R
Dataframe
Plyr
Dplyr
Drop rows containing empty cells from a pandas DataFrame
Python
Pandas
Dataframe
Drop
Replace None with NaN in pandas dataframe
Pandas
Dataframe
Replace
Nan
Nonetype
Move column by name to front of table in pandas
Python
Pandas
Move
Dataframe
Shift
How to find the size or shape of a DataFrame in PySpark?
Python
Dataframe
Pyspark
Select the row with the maximum value in each group
R
Dataframe
R Faq
Convert pandas data frame to series
Python
Pandas
Dataframe
Series
Filtering a data frame by values in a column
R
Filter
Dataframe
How can I split a column of tuples in a Pandas dataframe?
Python
Numpy
Pandas
Dataframe
Tuples
How to change a dataframe column from String type to Double type in PySpark?
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
The difference between bracket [ ] and double bracket [[ ]] for accessing the elements of a list or dataframe
R
List
Dataframe
Extract
R Faq
Converting a Pandas GroupBy output from Series to DataFrame
Python
Pandas
Dataframe
Pandas Groupby
Multi Index
Difference between map, applymap and apply methods in Pandas
Python
Pandas
Dataframe
Vectorization
How to check if any value is NaN in a Pandas DataFrame
Python
Pandas
Dataframe
Nan
Create empty data frame with column names by assigning a string vector?
R
Dataframe
Strip / trim all strings of a dataframe
Python
Regex
Pandas
Dataframe
Trim
What is the difference between using loc and using just square brackets to filter for columns in Pandas/Python?
Python
Pandas
Dataframe
Convert pandas Series to DataFrame
Python
Pandas
Dataframe
Series
Constructing pandas DataFrame from values in variables gives "ValueError: If using all scalar values, you must pass an index"
Python
Pandas
Dataframe
Scalar
Convert a list to a data frame
R
List
Dataframe
How to apply a function to two columns of Pandas dataframe
Python
Pandas
Dataframe
How do I combine two data-frames based on two columns?
R
Merge
Dataframe
Select first 4 rows of a data.frame in R
R
Dataframe
How to calculate the number of occurrence of a given character in each row of a column of strings?
Regex
R
Dataframe
Joining pandas DataFrames by Column names
Python
Pandas
Dataframe
How to add a suffix (or prefix) to each column name?
Python
Pandas
Dataframe
Convert List to Pandas Dataframe Column
Python
List
Pandas
Dataframe
Is there a way in Pandas to use previous row value in dataframe.apply when previous value is also calculated in the apply?
Python
Pandas
Dataframe
For Loop
Iteration
Apply function to each cell in DataFrame
Python
Pandas
Dataframe
Apply
UnicodeDecodeError when reading CSV file in Pandas with Python
Python
Pandas
Csv
Dataframe
Unicode
How to get a value from a cell of a dataframe?
Python
Pandas
Dataframe
How to shift a column in Pandas DataFrame
Python
Pandas
Dataframe
'DataFrame' object has no attribute 'sort'
Python
Pandas
Numpy
Dataframe
Split column at delimiter in data frame
R
Dataframe
selecting from multi-index pandas
Python
Pandas
Dataframe
Multi Index
How to append rows to an R data frame
R
Merge
Append
Dataframe
Rows
Trying to merge 2 dataframes but get ValueError
Python
Pandas
Dataframe
Drop unused factor levels in a subsetted data frame
R
Dataframe
R Factor
R Faq
Transpose a data frame
R
Dataframe
Create a group number for each consecutive sequence
R
Dataframe
Sequence
Construct pandas DataFrame from items in nested dictionary
Python
Pandas
Dataframe
Multi Index
Way to read first few lines for pandas dataframe
Python
Pandas
Csv
Dataframe
Python: get a frequency count based on two columns (variables) in pandas dataframe some row appers
Python
Pandas
Group By
Dataframe
How to read a Parquet file into Pandas DataFrame?
Python
Pandas
Dataframe
Parquet
Blaze
How to replace NaN values by Zeroes in a column of a Pandas Dataframe?
Python
Pandas
Dataframe
Nan
Import CSV file as a pandas DataFrame
Python
Pandas
Csv
Dataframe
String concatenation of two pandas columns
Python
String
Pandas
Numpy
Dataframe
Pandas selecting by label sometimes return Series, sometimes returns DataFrame
Python
Pandas
Dataframe
Slice
Series
Annotate bars with values on Pandas bar plots
Python
Matplotlib
Plot
Pandas
Dataframe
Create an empty data.frame
R
Dataframe
R Faq
Splitting dataframe into multiple dataframes
Python
Split
Pandas
Dataframe
Comparing two dataframes and getting the differences
Python
Pandas
Dataframe
Pandas split DataFrame by column value
Python
Pandas
Dataframe
Indexing
Split
Creating an R dataframe row-by-row
List
R
Dataframe
Display rows with one or more NaN values in pandas dataframe
Python
Pandas
Dataframe
Nan
How to delete all columns in DataFrame except certain ones?
Python
Pandas
Dataframe
Pandas DataFrame stored list as string: How to convert back to list
Python
String
List
Pandas
Dataframe
Replacing few values in a pandas dataframe column with another value
Python
Replace
Pandas
Dataframe
Spark Dataframe distinguish columns with duplicated name
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
How to get row from R data.frame
R
Indexing
Dataframe
Cleaning `Inf` values from an R dataframe
R
Dataframe
data.table
Spark DataFrame groupBy and sort in the descending order (pyspark)
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
Quickly reading very large tables as dataframes
R
Import
Dataframe
R Faq
Python Pandas: Convert ".value_counts" output to dataframe
Python
Pandas
Dataframe
Repeat rows of a data.frame
R
Dataframe
Rows
Repeat
Add (insert) a column between two columns in a data.frame
R
Dataframe
Insert
Shift column in pandas dataframe up by one?
Python
Pandas
Dataframe
Filter dataframe rows if value in column is in a set list of values
Python
Pandas
Dataframe
Subset of rows containing NA (missing) values in a chosen column of a data frame
R
Csv
Dataframe
Subset
Na
Apply Function on DataFrame Index
Python
Pandas
Indexing
Dataframe
How to merge a Series and DataFrame
Python
Pandas
Dataframe
How to replace negative numbers in Pandas Data Frame by zero
Python
Pandas
Dataframe
Replace
Negative Number
Python pandas: how to specify data types when reading an Excel file?
Python
Pandas
Dataframe
How to plot two columns of a pandas data frame using points
Python
Matplotlib
Plot
Pandas
Dataframe
How to create an empty DataFrame with a specified schema?
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Selecting a row of pandas series/dataframe by integer index
Python
Pandas
Dataframe
Indexing
Combine two pandas Data Frames (join on a common column)
Python
Pandas
Dataframe
Merge
Left Join
Rename multiple columns by names
R
Dataframe
Rename
R Faq
Add an index (numeric ID) column to large data frame
R
Dataframe
Get first element of Series without knowing the index
Python
Pandas
Dataframe
Series
Head
Filtering Pandas Dataframe using OR statement
Python
Pandas
Filter
Dataframe
Count number of non-NaN entries in every column of Dataframe
Python
Pandas
Dataframe
Count
Nan
Remove Unnamed columns in pandas dataframe
Python
Pandas
Dataframe
How to check whether a pandas DataFrame is empty?
Python
Pandas
Dataframe
Replace invalid values with None in Pandas DataFrame
Python
Pandas
Dataframe
Replace
Nan
Group dataframe and get sum AND count?
Python
Pandas
Dataframe
Group By
Pandas Groupby
How do you remove the column name row when exporting a pandas DataFrame?
Python
Pandas
Csv
Dataframe
Header
How to explode a list inside a Dataframe cell into separate rows
Python
Pandas
Dataframe
Python Pandas How to assign groupby operation results back to columns in parent dataframe?
Python
Group By
Dataframe
Pandas
Select the first row by group
R
Dataframe
Sqldf
Convert Named Character Vector to data.frame
R
Dataframe
Vector
Type Conversion
Replace all particular values in a data frame
R
Dataframe
Replace
Ambiguity in Pandas Dataframe / Numpy Array "axis" definition
Python
Arrays
Pandas
Numpy
Dataframe
How to export a table dataframe in PySpark to csv?
Python
Apache Spark
Dataframe
Apache Spark-Sql
Export to-Csv
Retrieve DataFrame of all but one specified column
Python
Pandas
Dataframe
R spreading multiple columns with tidyr
R
Dataframe
Dplyr
Tidyr
Renaming column names of a DataFrame in Spark Scala
Scala
Apache Spark
Dataframe
Apache Spark-Sql
How to make separator in pandas read_csv more flexible wrt whitespace, for irregular separators?
Python
Csv
Pandas
Dataframe
Whitespace
Split a large dataframe into a list of data frames based on common value in column
R
Performance
Matrix
Split
Dataframe
How to check if a column exists in Pandas
Python
Pandas
Dataframe
Move a column to first position in a data frame
R
Dataframe
Sum rows in data.frame or matrix
R
Dataframe
Matrix
Rowsum
Pandas version of rbind
Python
R
Dataframe
Pandas
Pandas - How to flatten a hierarchical index in columns
Python
Pandas
Dataframe
how to sort pandas dataframe from one column
Python
Pandas
Dataframe
Sorting
Time
pandas groupby without turning grouped by column into index
Python
Pandas
Dataframe
Repeat rows of a data.frame N times
R
Dataframe
How to delete columns that contain ONLY NAs?
R
Dataframe
Na
Join two data frames, select all columns from one and some columns from the other
Dataframe
Apache Spark
Pyspark
Apache Spark-Sql
How to reset index in a pandas dataframe?
Python
Indexing
Pandas
Dataframe
Convert Python dict into a dataframe
Python
Pandas
Dataframe
How to reorder indexed rows based on a list in Pandas data frame
Python
Pandas
Dataframe
How to append rows in a pandas dataframe in a for loop?
Python
For Loop
Pandas
Dataframe
append dictionary to data frame
Python
Python 3.x
Pandas
Dataframe
How do I create test and train samples from one dataframe with pandas?
Python
Python 2.7
Pandas
Dataframe
Pandas read_csv low_memory and dtype options
Python
Parsing
Numpy
Pandas
Dataframe
How to replace NA values in a table for selected columns
R
Replace
Dataframe
data.table
Na
Python Pandas replicate rows in dataframe
Python
Pandas
Dataframe
ValueError: Length of values does not match length of index | Pandas DataFrame.unique()
Python
Pandas
Dataframe
AttributeError: 'DataFrame' object has no attribute 'ix'
Python
Pandas
Dataframe
start index at 1 for Pandas DataFrame
Python
Pandas
Csv
Dataframe
Indexing
changing sort in value_counts
Python
Pandas
Dataframe
Lambda including if...elif...else
Python
Pandas
Dataframe
How can I use the apply() function for a single column?
Python
Pandas
Dataframe
move column in pandas dataframe
Python
Pandas
Dataframe
How to remove timezone from a Timestamp column in a pandas dataframe
Python
Pandas
Dataframe
Timezone
Timestamp with-Timezone
How to sum a variable by group
R
Dataframe
Aggregate
R Faq
Changing column names of a data frame
R
Dataframe
Rename
Updating a dataframe column in spark
Python
Dataframe
Apache Spark
Pyspark
Apache Spark-Sql
pandas - filter dataframe by another dataframe by row elements
Python
Pandas
Dataframe
Spark SQL: apply aggregate functions to a list of columns
Apache Spark
Dataframe
Apache Spark-Sql
Aggregate Functions
Merging dataframes on index with pandas
Python
Pandas
Merge
Dataframe
Create a data.frame where a column is a list
R
List
Dataframe
Selecting/excluding sets of columns in pandas
Python
Pandas
Dataframe
R Apply() function on specific dataframe columns
R
Dataframe
Apply
Concatenate rows of two dataframes in pandas
Python
Pandas
Dataframe
Renaming columns for PySpark DataFrame aggregates
Dataframe
Apache Spark
Pyspark
Apache Spark-Sql
Coerce multiple columns to factors at once
R
Dataframe
R Factor
Extract values in Pandas value_counts()
Python
Pandas
Dataframe
Series
Elegant way to create empty pandas DataFrame with NaN of type float
Python
Pandas
Numpy
Dataframe
Nan
substring of an entire column in pandas dataframe
Python
Pandas
Dataframe
Groupby value counts on the dataframe pandas
Python
Pandas
Dataframe
Crosstab
Pandas Groupby
Counting unique / distinct values by group in a data frame
R
Dataframe
Distinct Values
R Faq
Get current number of partitions of a DataFrame
Python
Scala
Dataframe
Apache Spark
Apache Spark-Sql
Forcing pandas .iloc to return a single-row dataframe?
Python
Pandas
Dataframe
Indexing
Pandas get the most frequent values of a column
Python
Pandas
Dataframe
How to plot all the columns of a data frame in R
R
Dataframe
Plot
R Faq
How to sum data.frame column values?
R
Dataframe
Sum
Aggregate Functions
Calculate the mean by group
R
Dataframe
R Faq
Removing display of row names from data frame
R
Printing
Dataframe
Output Formatting
Rowname
How to open and convert sqlite database to pandas dataframe
Python
Database
Sqlite
Pandas
Dataframe
pandas .at versus .loc
Python
Pandas
Dataframe
Pandas Groupby and Sum Only One Column
Python
Pandas
Dataframe
Pandas Groupby
Delete a column from a Pandas DataFrame
Python
Pandas
Dataframe
Pandas conditional creation of a series/dataframe column
Python
Pandas
Numpy
Dataframe
Combine a list of data frames into one data frame by row
R
List
Dataframe
R Faq
making matplotlib scatter plots from dataframes in Python's pandas
Python
Matplotlib
Plot
Dataframe
Pandas
Convert all data frame character columns to factors
R
Dataframe
Get the name of a pandas DataFrame
Python
Pandas
Dataframe
Attributes
Redefining the Index in a Pandas DataFrame object
Python
Pandas
Dataframe
How to check if a value is in the list in selection from pandas data frame?
Python
Select
Numpy
Pandas
Dataframe
Remove duplicates from dataframe, based on two columns A,B, keeping row with max value in another column C
Python
Pandas
Dataframe
Duplicates
Merge two data frames based on common column values in Pandas
Pandas
Dataframe
Extracting specific columns from a data frame
R
Dataframe
R Faq
Sorting columns in pandas dataframe based on column name
Python
Pandas
Dataframe
How to quickly form groups (quartiles, deciles, etc) by ordering column(s) in a data frame
R
Sorting
Dataframe
Elegant way to report missing values in a data.frame
R
Dataframe
Missing Data
Merging a lot of data.frames
R
Dataframe
Merge
Is there a way to copy only the structure (not the data) of a Pandas DataFrame?
Python
Pandas
Dataframe
How do I sum values in a column that match a given condition using pandas?
Python
Pandas
Dataframe
Data Analysis
Putting many python pandas dataframes to one excel worksheet
Python
Excel
Pandas
Dataframe
Xlsxwriter
Pandas: drop columns with all NaN's
Python
Pandas
Dataframe
In Place
Convert a dataframe to a vector (by rows)
R
Dataframe
Vector
R Faq
Elegant indexing up to end of vector/matrix
R
Matrix
Dataframe
Indexing
Finding common rows (intersection) in two Pandas dataframes
Python
Pandas
Dataframe
Intersect
Convert a data frame to a data.table without copy
R
Dataframe
Reference
data.table
dplyr::select one column and output as vector
R
Select
Vector
Dataframe
Dplyr
Pandas merge two dataframes with different columns
Python
Pandas
Dataframe
Data Munging
Converting two columns of a data frame to a named vector
R
Vector
Dataframe
Coercion
dplyr change many data types
R
Dataframe
Dplyr
How to Add Incremental Numbers to a New Column Using Pandas
Python
Pandas
Dataframe
What is the difference between using squared brackets or dot to access a column?
Python
Pandas
Dataframe
Indexing
Pandas Split Dataframe into two Dataframes at a specific row
Python
Pandas
Numpy
Dataframe
How to split a data frame?
R
Split
Dataframe
R Faq
Subset / filter rows in a data frame based on a condition in a column
R
Dataframe
Subset
R Faq
Undefined columns selected when subsetting data frame
R
Dataframe
Subset
Replace all occurrences of a string in a data frame
R
Dataframe
How to pivot Spark DataFrame?
Dataframe
Apache Spark
Pyspark
Apache Spark-Sql
Pivot
Coalesce values from 2 columns into a single column in a pandas dataframe
Python
Pandas
Numpy
Dataframe
Pyspark: Split multiple array columns into rows
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
Replacing character values with NA in a data frame
R
Dataframe
Na
R: losing column names when adding rows to an empty data frame
R
Dataframe
Names
Rbind
Convert DataFrame column type from string to datetime
Python
Pandas
Dataframe
Datetime Format
Python Datetime
Conditional replacement of values in a data.frame
R
Dataframe
How to print (to paper) a nicely-formatted data frame
R
Dataframe
Formatting
Counting non zero values in each column of a DataFrame in python
Python
Pandas
Dataframe
Pandas Groupby
How to convert an XML file to nice pandas dataframe?
Python
Xml
Pandas
Dataframe
Parsing
How do I find the closest values in a Pandas series to an input number?
Python
Pandas
Dataframe
Ranking
Pandas - Filtering None Values
Python
Pandas
Dataframe
How to reversibly store and load a Pandas dataframe to/from disk
Python
Pandas
Dataframe
Sample random rows in dataframe
R
Dataframe
Random
R Faq
What is difference between dataframe and list in R?
R
List
Dataframe
Re-ordering factor levels in data frame
R
Dataframe
Levels
R Faq
How to check if two data frames are equal
Database
R
Dataset
Compare
Dataframe
Adding a column thats result of difference in consecutive rows in pandas
Pandas
Dataframe
Series
pandas apply function that returns multiple values to rows in pandas dataframe
Python
Pandas
Dataframe
Apply
Iterable Unpacking
How to concatenate multiple column values into a single column in Pandas dataframe
Python
Pandas
Dataframe
Python: create a pandas data frame from a list
List
Python 3.x
Pandas
Dataframe
What you can do with a data.frame that you can't with a data.table?
R
Dataframe
data.table
Merge two data frames while keeping the original row order
R
Sorting
Dataframe
Merge
Convert pandas DataFrame into list of lists
Python
Pandas
Dataframe
Pandas column bind (cbind) two data frames
Python
Pandas
Dataframe
pandas - change df.index from float64 to unicode or string
Python
Pandas
Indexing
Dataframe
Rows
Removing space from columns in pandas
Python
Pandas
Dataframe
pandas: filter rows of DataFrame with operator chaining
Python
Pandas
Dataframe
Merge Two Lists in R
R
List
Dataframe
Find indices of duplicated rows
R
Duplicates
Dataframe
How to convert OpenDocument spreadsheets to a pandas DataFrame?
Python
Pandas
Libreoffice
Dataframe
Opendocument
Nested dictionary to multiindex dataframe where dictionary keys are column labels
Python
Dictionary
Pandas
Dataframe
Multi Index
Create Spark DataFrame. Can not infer schema for type: <type 'float'>
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
Set value to an entire column of a pandas dataframe
Python
Pandas
Dataframe
How to make good reproducible Apache Spark examples
Dataframe
Apache Spark
Pyspark
Apache Spark-Sql
How to add pandas data to an existing csv file?
Python
Pandas
Csv
Dataframe
How to slice a Pandas Data Frame by position?
Python
Pandas
Dataframe
Slice
python pandas extract year from datetime: df['year'] = df['date'].year is not working
Python
Datetime
Pandas
Extract
Dataframe
pandas get the row-wise minimum value of two or more columns
Python
Pandas
Dataframe
pandas dataframe convert column type to string or categorical
Pandas
Dataframe
Type Conversion
Categorical Data
Aggregation in Pandas
Python
Pandas
Dataframe
Pandas Groupby
Aggregation
How to get row index number in R?
R
Dataframe
How do I extract a single column from a data.frame as a data.frame?
R
Dataframe
Subset
Querying Spark SQL DataFrame with complex types
Sql
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Row-wise average for a subset of columns with missing values
Python
Pandas
Dataframe
How to loop through each row of dataFrame in pyspark
Apache Spark
Dataframe
For Loop
Pyspark
Apache Spark-Sql
Find empty or NaN entry in Pandas Dataframe
List
Python 2.7
Pandas
Indexing
Dataframe
How to get number of groups in a groupby object in pandas?
Python
Pandas
Dataframe
Group By
Pandas Groupby
Indexing Pandas data frames: integer rows, named columns
Python
Pandas
Dataframe
Python/Pandas: counting the number of missing/NaN in each row
Pandas
Count
Row
Dataframe
Nan
Pandas apply but only for rows where a condition is met
Python
Pandas
Dataframe
Apply
How to simply add a column level to a pandas dataframe
Python
Pandas
Dataframe
Multi Level
Cartesian product data frame
R
Dataframe
Delete rows with blank values in one particular column
R
Dataframe
Missing Data
Normalize columns of pandas data frame
Python
Pandas
Dataframe
Normalize
How to remove levels from a multi-indexed dataframe?
Pandas
Dataframe
Multi Index
Finding non-numeric rows in dataframe in pandas?
Python
Pandas
Dataframe
Convert Pandas DataFrame to JSON format
Json
Pandas
Dataframe
Pandas dataframe groupby plot
Python
Pandas
Matplotlib
Dataframe
Pandas get frequency of item occurrences in a column as percentage
Python
Pandas
Dataframe
Percentage
Remove pandas rows with duplicate indices
Python
Pandas
Dataframe
Duplicates
Convert data.frame columns from factors to characters
R
Dataframe
Merge unequal dataframes and replace missing rows with 0
R
Merge
Dataframe
Why am I getting X. in my column names when reading a data frame?
R
Dataframe
read.csv
Illegal Characters
R pass variable column indices to ggplot2
R
Ggplot2
Dataframe
Merge multiple column values into one column in python pandas
Python
List
Pandas
Row
Dataframe
Dynamically evaluate an expression from a formula in Pandas
Python
Pandas
Dataframe
Formula
Eval
What is the most efficient way to loop through dataframes with pandas?
Python
Pandas
Performance
Dataframe
For Loop
Add an empty column to Spark DataFrame
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
Provide schema while reading csv file as a dataframe
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Spark Csv
Trouble passing in lambda to apply for pandas DataFrame
Python
Pandas
Dataframe
Lambda
Apply
Transform a Counter object into a Pandas DataFrame
Python
Pandas
Dataframe
Counter
Python / Pandas - GUI for viewing a DataFrame or Matrix
Python
User Interface
Pandas
Dataframe
Add a prefix to column names
R
Dataframe
How to sort pandas data frame using values from several columns?
Python
Sorting
Dataframe
Pandas
Testing if a pandas DataFrame exists
Python
Pandas
Dataframe
How do I get the row count of a Pandas DataFrame?
Python
Pandas
Dataframe
Selecting multiple columns in a Pandas dataframe
Python
Pandas
Dataframe
Select
How does one reorder columns in a data frame?
R
Sorting
Dataframe
R Faq
Calculate row means on subset of columns
R
Dataframe
Calculate time difference between Pandas Dataframe indices
Python
Dataframe
Pandas
Cumulative sum and percentage on column?
Python
Pandas
Dataframe
Cumulative Sum
Select only one index of multiindex DataFrame
Python
Pandas
Select
Dataframe
Indexing
What does axis in pandas mean?
Python
Pandas
Numpy
Dataframe
create pandas dataframe from dictionary of dictionaries
Dictionary
Pandas
Dataframe
Checking if particular value (in cell) is NaN in pandas DataFrame not working using ix or iloc
Python
Pandas
Dataframe
Nan
Convert Pandas dataframe to PyTorch tensor?
Python
Pandas
Dataframe
Pytorch
Unseen factor levels when appending new records with unseen string values to a dataframe, cause Warning and result in NA
R
Dataframe
Append
R Factor
Pandas index column title or name
Python
Pandas
Dataframe
Columnname
List all column except for one in R
R
Dataframe
How do you Unit Test Python DataFrames
Unit Testing
Pandas
Numpy
Dataframe
Combining two Series into a DataFrame in pandas
Python
Pandas
Series
Dataframe
How to sort a data frame by date
R
Sorting
Date
Dataframe
Remove Rows From Data Frame where a Row matches a String
R
Dataframe
Remove NaN/NULL columns in a Pandas dataframe?
Python
Pandas
Dataframe
Nan
Find all columns of dataframe in Pandas whose type is float, or a particular type?
Python
Pandas
Dataframe
Data Cleaning
How do I get the name of the rows from the index of a data frame?
Python
Pandas
Dataframe
How to calculate mean values grouped on another column in Pandas
Python
Pandas
Dataframe
Count frequency of values in pandas DataFrame column
Python
Django
Pandas
Dataframe
pandas concat generates nan values
Python
Pandas
Dataframe
Concatenation
Nan
How to drop columns by name in a data frame
R
Dataframe
Subset
Pandas column access w/column names containing spaces
Python
Pandas
String
Dataframe
SQL-like window functions in PANDAS: Row Numbering in Python Pandas Dataframe
Python
Pandas
Numpy
Dataframe
calculate the mean for each column of a matrix in R
R
Dataframe
Mean
How to repeat a Pandas DataFrame?
Python
Pandas
Duplicates
Dataframe
Repeat
python pandas flatten a dataframe to a list
Python
List
Numpy
Pandas
Dataframe
Diff of two Dataframes
Python
Pandas
Dataframe
Diff
How to join two dataframes for which column values are within a certain range?
Python
Pandas
Datetime
Dataframe
Intervals
How to convert column with string type to int form in pyspark data frame?
Python
Dataframe
Apache Spark
Pyspark
Apache Spark-Sql
How to fix spaces in column names of a data.frame (remove spaces, inject dots)?
R
Dataframe
Sort pandas dataframe both on values of a column and index?
Python
Pandas
Sorting
Dataframe
dplyr: nonstandard column names (white space, punctuation, starts with numbers)
R
Dataframe
Dplyr
Running get_dummies on several DataFrame columns?
Python
Pandas
Dataframe
One Hot-Encoding
How to query JSON data column using Spark DataFrames?
Scala
Apache Spark
Dataframe
Apache Spark-Sql
Spark Cassandra-Connector
Performant cartesian product (CROSS JOIN) with pandas
Python
Pandas
Numpy
Dataframe
Merge
pandas get rows which are NOT in other dataframe
Python
Pandas
Dataframe
Get column index from column name in python pandas
Python
Pandas
Dataframe
Indexing
Merge data frames based on rownames in R
R
Merge
Dataframe
Rearrange dataframe to a table, the opposite of "melt"
R
Dataframe
Reshape
Split data.frame based on levels of a factor into new data.frames
R
Dataframe
R Faq
Replacing values from a column using a condition in R
R
Dataframe
Conditional Statements
How to convert a list consisting of vector of different lengths to a usable data frame in R?
R
Vector
Dataframe
Filtering all rows with NaT in a column in Dataframe python
Python
Pandas
Dataframe
Pandas sum across columns and divide each cell from that value
Python
Pandas
Dataframe
Get weekday/day-of-week for Datetime column of DataFrame
Python
Pandas
Dataframe
Datetime
Dayofweek
How do I filter a pandas DataFrame based on value counts?
Python
Pandas
Filtering
Dataframe
How do I Pandas group-by to get sum?
Python
Pandas
Dataframe
Group By
Aggregate
How to access the last value in a vector?
R
Dataframe
Vector
Anti-Join Pandas
Python
Pandas
Dataframe
Merge
Anti Join
Add row to a data frame with total sum for each column
R
Dataframe
Sort (order) data frame rows by multiple columns
R
Sorting
Dataframe
R Faq
Pandas (python): How to add column to dataframe for index?
Python
Indexing
Dataframe
Pandas
Export a LaTeX table from pandas DataFrame
Python
Latex
Dataframe
Pandas
Spark: subtract two DataFrames
Apache Spark
Dataframe
Rdd
Is it possible to append Series to rows of DataFrame without making a list first?
Python
Pandas
Machine Learning
Dataframe
Series
Pythonic/efficient way to strip whitespace from every Pandas Data frame cell that has a stringlike object in it
Python
Pandas
Dataframe
Convert float64 column to int64 in Pandas
Python
Pandas
Dataframe
How to read a list of parquet files from S3 as a pandas dataframe using pyarrow?
Python
Pandas
Dataframe
Boto3
Pyarrow
Check for duplicate values in Pandas dataframe column
Python
Pandas
Dataframe
Duplicates
Opposite of %in%: exclude rows with values specified in a vector
R
Dataframe
Subset
Pandas sort by group aggregate and column
Python
Sorting
Group By
Dataframe
Pandas
How to save a data frame as CSV to a user selected location using tcltk
R
Csv
Save
Dataframe
How to add new column to an dataframe (to the front not end)?
R
Dataframe
Remove last N rows in data frame with the arbitrary number of rows
R
Dataframe
Row
Replace all occurrences of a string in a pandas dataframe (Python)
Python
Replace
Pandas
Dataframe
Python Pandas Group by date using datetime data
Python
Pandas
Datetime
Dataframe
Pandas Groupby
Rolling Mean on pandas on a specific column
Python
Python 3.x
Pandas
Dataframe
Conditionally fill column values based on another columns value in pandas
Python
Python 3.x
Pandas
Dataframe
Numpy
set difference for pandas
Python
Pandas
Dataframe
Pandas Replace NaN with blank/empty string
Python
Pandas
Dataframe
Nan
Select DataFrame rows between two dates
Python
Pandas
Dataframe
mutate_each / summarise_each in dplyr: how do I select certain columns and give new names to mutated columns?
R
Dataframe
Dplyr
How to remove a pandas dataframe from another dataframe
Python
Pandas
Dataframe
Subtraction
Retrieve top n in each group of a DataFrame in pyspark
Python
Apache Spark
Dataframe
Pyspark
Apache Spark-Sql
Pandas DataFrame Add column to index without resetting
Python
Dataframe
Pandas
pandas comparison raises TypeError: cannot compare a dtyped [float64] array with a scalar of type [bool]
Python
Pandas
Typeerror
Dataframe
How to convert dataframe to dictionary in pandas WITHOUT index
Python
Pandas
Dictionary
Dataframe
Why is it not advisable to use attach() in R, and what should I use instead?
R
Dataframe
R Faq
Why is plyr so slow?
R
Dataframe
Plyr
data.table
write.csv for large data.table
R
File Io
Csv
Dataframe
data.table
return max value from pandas dataframe as a whole, not based on column or rows
Python
Pandas
Max
Dataframe
count number of rows in a data frame in R based on group
R
Dataframe
Rowcount
Pandas: join DataFrames on field with different names?
Python
Pandas
Join
Dataframe
Field
dply: order columns alphabetically in R
R
Dataframe
Dplyr
In Python pandas, start row index from 1 instead of zero without creating additional column
Python
Pandas
Indexing
Dataframe
Numpy "where" with multiple conditions
Python
Pandas
Numpy
Dataframe
Is there a simple way to change a column of yes/no to 1/0 in a Pandas dataframe?
Python
Pandas
Dataframe
Series
How to replace all Null values of a dataframe in Pyspark
Dataframe
Null
Pyspark
Convert a Pandas DataFrame to a dictionary
Python
Pandas
Dictionary
Dataframe
Compare if two dataframe objects in R are equal?
R
Dataframe
Compare
Equality
Using .loc with a MultiIndex in pandas
Python
Pandas
Dataframe
Multi Index