python批量統計excel數據_利用python對excel計數並輸出結果

Ⅰ pandas如何統計excel中列數據的行數

准備測試數據：

運行效果展示

附上實現代碼：

#!/usr/bin/env python

import pandas as pd

OPENPATH = 'test.xls'
SAVEPATH = 'test1.xls'

def total_count(path=OPENPATH, sheetname='testsheet'):
df = pd.read_excel(path, sheet_name=sheetname, names=['值', '計數'])
# 獲取統計項目
item_name = set(df['值'])
# 創建字典統計
total_dict = dict(zip([i for i in item_name], [
0 for _ in range(len(item_name))]))
# 遍歷」值「列，逐個統計數量
for index, item in enumerate(df['值']):
# 如果在set中
if item in item_name:
# 加入計數統計
total_dict[item] += df['計數'][index]
# 返回
return total_dict

def datato_excel(path=SAVEPATH, sheet_name='total', data_dict={}):
report_df = pd.DataFrame.from_dict(data_dict,orient='index')
xl_writer = pd.ExcelWriter(path)
report_df.to_excel(xl_writer, sheet_name)
try:
xl_writer.save()
print('Save completed')
except:
print('Error in saving file')

if __name__ == "__main__":
datato_excel(data_dict=total_count())

不知道這是不是您想要的結果，如果有幫助，請採納一下，謝謝！

Ⅱ python如何處理大量excel數據

使用pip install openpyxl即可，但是在windows下安裝的是2.2.6版本，但是centos自動安裝的是4.1版本。

from xlwt import Workbook, Formula

import xlrd

book = Workbook()

sheet1 = book.add_sheet('Sheet 1')

sheet1.write(0,0,10)

sheet1.write(1,0,Formula('A1/B1'))

sheet2 = book.add_sheet('Sheet 2')

row = sheet2.row(0)

row.write(2,Formula("$A$1+$B$1*SUM('ShEEt 1'!$A$1:$b$2)"))

book = xlrd.open_workbook('formula.xls')

sheet = book.sheets()[0]

for i in range(nrows):

print (sheet.cell(i,j).value)

Python

是完全面向對象的語言。函數、模塊、數字、字元串都是對象。並且完全支持繼承、重載、派生、多繼承，有益於增強源代碼的復用性。Python支持重載運算符和動態類型。相對於Lisp這種傳統的函數式編程語言，Python對函數式設計只提供了有限的支持。有兩個標准庫(functools, itertools)提供了Haskell和Standard ML中久經考驗的函數式程序設計工具。

Ⅲ python批處理多個excel文件，得出所有excel文件中某一列的和，具體程序如下：文件都在D:/excel中

import os,xlrd
numh=0.0
for f in os.listdir('D:\excel'):
    print "file:",f
    wb=xlrd.open_workbook(os.path.abspath(f))
    table=wb.sheet_by_index(1)
    for r in range(table.nrows):
        numh=table.cell(r,0).value+numh
    print 'numh:',numh

Ⅳ 如何使用python或者宏對excel中數據進行統計重復次數

在Python中pandas庫用於數據處理，我們從1787頁的pandas官網文檔中總結出最常用的36個函數，通過這些函數介紹如何通過Python完成數據生成和導入，數據清洗，預處理，以及最常見的數據分類，數據篩選，分類匯總，透視等最常見的操作。

Ⅳ 如何用python遍歷文件夾下的所有excel文件

大數據處理經常要用到一堆表格，然後需要把數據導入一個list中進行各種演算法分析，簡單講一下自己的做法：

1.如何讀取excel文件

網上的版本很多，在xlrd模塊基礎上，找到一些源碼：

[python]view plain

importxdrlib,sys
importxlrd
defopen_excel(file="C:/Users/flyminer/Desktop/新建MicrosoftExcel工作表.xlsx"):
data=xlrd.open_workbook(file)
returndata
#根據索引獲取Excel表格中的數據參數:file：Excel文件路徑colnameindex：表頭列名所在行的所以，by_index：表的索引
defexcel_table_byindex(file="C:/Users/flyminer/Desktop/新建MicrosoftExcel工作表.xlsx",colnameindex=0,by_index=0):
data=open_excel(file)
table=data.sheets()[by_index]
nrows=table.nrows#行數
ncols=table.ncols#列數
colnames=table.row_values(colnameindex)#某一行數據
list=[]
forrownuminrange(1,nrows):
row=table.row_values(rownum)
ifrow:
app={}
foriinrange(len(colnames)):
app[colnames[i]]=row[i]
list.append(app)
returnlist
#根據名稱獲取Excel表格中的數據參數:file：Excel文件路徑colnameindex：表頭列名所在行的所以，by_name：Sheet1名稱
defexcel_table_byname(file="C:/Users/flyminer/Desktop/新建MicrosoftExcel工作表.xlsx",colnameindex=0,by_name=u'Sheet1'):
data=open_excel(file)
table=data.sheet_by_name(by_name)
nrows=table.nrows#行數
colnames=table.row_values(colnameindex)#某一行數據
list=[]
forrownuminrange(1,nrows):
row=table.row_values(rownum)
ifrow:
app={}
foriinrange(len(colnames)):
app[colnames[i]]=row[i]
list.append(app)
returnlist
defmain():
tables=excel_table_byindex()
forrowintables:
print(row)
tables=excel_table_byname()
forrowintables:
print(row)
if__name__=="__main__":
main()
最後一句是重點，所以這里也給代碼人點個贊！

最後一句讓代碼里的函數都可以被復用，簡單地說：假設文件名是a，在程序中import a以後，就可以用a.excel_table_byname()和a.excel_table_byindex()這兩個超級好用的函數了。

2.然後是遍歷文件夾取得excel文件以及路徑：，原創代碼如下：

[python]view plain

importos
importxlrd
importtest_wy
xpath="E:/唐偉捷/電力/電力系統總文件夾/舟山電力"
xtype="xlsx"
typedata=[]
name=[]
raw_data=[]
file_path=[]
defcollect_xls(list_collect,type1):
#取得列表中所有的type文件
foreach_elementinlist_collect:
ifisinstance(each_element,list):
collect_xls(each_element,type1)
elifeach_element.endswith(type1):
typedata.insert(0,each_element)
returntypedata
#讀取所有文件夾中的xls文件
defread_xls(path,type2):
#遍歷路徑文件夾
forfileinos.walk(path):
foreach_listinfile[2]:
file_path=file[0]+"/"+each_list
#os.walk()函數返回三個參數：路徑，子文件夾，路徑下的文件，利用字元串拼接file[0]和file[2]得到文件的路徑
name.insert(0,file_path)
all_xls=collect_xls(name,type2)
#遍歷所有type文件路徑並讀取數據
forevey_nameinall_xls:
xls_data=xlrd.open_workbook(evey_name)
foreach_sheetinxls_data.sheets():
sheet_data=test_wy.excel_table_byname(evey_name,0,each_sheet.name)
#請參考讀取excel文件的代碼
raw_data.insert(0,sheet_data)
print(each_sheet.name,":Datahasbeendone.")
returnraw_data
a=read_xls(xpath,xtype)
print("Victory")
歡迎各種不一樣的想法~~

Ⅵ 利用python對excel計數，並輸出結果

通常是直接用命令行cat 文件名|grep -c "idea"就可以解決。
在python里也可以直接 open('文件名','rb').read().count('idea')這樣的方式取得數量
如果是復雜一些，可以用正則、
text=open('文件名','rb').read()
re.findall('(?isu)"username":"idea"', text)

導航:首頁 > 編程語言 > python批量統計excel數據

python批量統計excel數據

與python批量統計excel數據相關的資料