dataframe Archives - Page 25 of 36

Combining two Series into a DataFrame in pandas

August 16, 2022 by Magenaut

I have two Series s1 and s2 with the same (non-consecutive) indices. How do I combine s1 and s2 to being two columns in a DataFrame and keep one of the indices as a third column?

Retrieve top n in each group of a DataFrame in pyspark

August 16, 2022 by Magenaut

There’s a DataFrame in pyspark with data as below:

replace column values in one dataframe by values of another dataframe

August 16, 2022 by Magenaut

I have two dataframes, the first one has 1000 rows and looks like:

How to export a table dataframe in PySpark to csv?

August 16, 2022 by Magenaut

I am using Spark 1.3.1 (PySpark) and I have generated a table using a SQL query. I now have an object that is a DataFrame. I want to export this DataFrame object (I have called it “table”) to a csv file so I can manipulate it and plot the columns. How do I export the DataFrame “table” to a csv file?

pandas concat generates nan values

August 16, 2022 by Magenaut

I am curious why a simple concatenation of two data frames in pandas:

Select multiple ranges of columns in Pandas DataFrame

August 16, 2022 by Magenaut

I have to read several files some in Excel format and some in CSV format. Some of the files have hundreds of columns.

Pandas filling missing dates and values within group

August 16, 2022 by Magenaut

I’ve a data frame that looks like the following

Pandas selecting by label sometimes return Series, sometimes returns DataFrame

August 16, 2022 by Magenaut

In Pandas, when I select a label that only has one entry in the index I get back a Series, but when I select an entry that has more then one entry I get back a data frame.

Sort a pandas dataframe series by month name

August 16, 2022 by Magenaut

I have a Series object that has:

How do I filter a pandas DataFrame based on value counts?

August 16, 2022 by Magenaut

I’m working in Python with a pandas DataFrame of video games, each with a genre. I’m trying to remove any video game with a genre that appears less than some number of times in the DataFrame, but I have no clue how to go about this. I did find a StackOverflow question that seems to be related, but I can’t decipher the solution at all (possibly because I’ve never heard of R and my memory of functional programming is rusty at best).