pythonrsplit函數_python 獲取文件後綴名

⑴ python 獲取文件後綴名

程序代碼如下所示：

importos
dict={}
ford,fd,flinos.walk(r"F:\"):
forfinfl:
sufix=os.path.splitext(f)[1][1:]
ifdict.has_key(sufix):
dict[sufix]+=1
else:
dict[sufix]=1

foritemindict.items():
print"%s:%s"%item

第二行：創建一個字典用來保存文件後綴名及個數；
第三行：循環的目的主要就是os.path.splitext()分離後綴名和文件名；
最後列印輸出。

⑵ python中的strip和split結合起來怎麼用

python strip() 函數和 split() 函數的詳解及實例
一直以來都分不清楚strip和split的功能，實際上strip是刪除的意思；而split則是分割的意思。因此也表示了這兩個功能是完全不一樣的，strip可以刪除字元串的某些字元，而split則是根據規定的字元將字元串進行分割。下面就詳細說一下這兩個功能，
1 Python strip()函數介紹
函數原型
聲明：s為字元串，rm為要刪除的字元序列
s.strip(rm) 刪除s字元串中開頭、結尾處，位於 rm刪除序列的字元
s.lstrip(rm) 刪除s字元串中開頭處，位於 rm刪除序列的字元
s.rstrip(rm) 刪除s字元串中結尾處，位於 rm刪除序列的字元
注意：
（1）當rm為空時，默認刪除空白符（包括'\n', '\r', '\t', ' ')
（2）這里的rm刪除序列是只要邊（開頭或結尾）上的字元在刪除序列內，就刪除掉。
例如，

>>> a = ' 123'
>>> a
' 123'
>>> a.strip()
'123'

（2）這里的rm刪除序列是只要邊（開頭或結尾）上的字元在刪除序列內，就刪除掉。

例如，

>>> a = '123abc'
>>> a.strip('21')
'3abc'
>>> a.strip('12')
'3abc'

結果是一樣的。

2 python split()函數介紹

說明：

Python中沒有字元類型的說法，只有字元串，這里所說的字元就是只包含一個字元的字元串！！！

這里這樣寫的原因只是為了方便理解，僅此而已。
（1）按某一個字元分割，如『.'
>>> str = ('www.google.com')
>>> print str
www.google.com
>>> str_split = str.split('.')
>>> print str_split
['www', 'google', 'com']

（2）按某一個字元分割，且分割n次。如按『.'分割1次
>>> str_split = str.split('.',1)
>>> print str_split
['www', 'google.com']

（3）split()函數後面還可以加正則表達式，例如：
>>> str_split = str.split('.')[0]
>>> print str_split
www

split分隔後是一個列表，[0]表示取其第一個元素；

>>> str_split = str.split('.')[::-1]
>>> print str_split
['com', 'google', 'www']
>>> str_split = str.split('.')[::]
>>> print str_split
['www', 'google', 'com']

按反序列排列，[::]安正序排列
>>> str = str + '.com.cn'
>>> str
'www.google.com.com.cn'
>>> str_split = str.split('.')[::-1]
>>> print str_split
['cn', 'com', 'com', 'google', 'www']
>>> str_split = str.split('.')[:-1]
>>> print str_split
['www', 'google', 'com', 'com']

從首個元素開始到次末尾，最後一個元素刪除掉。
split()函數典型應用之一，ip數字互換：
# ip ==> 數字
>>> ip2num = lambda x:sum([256**j*int(i) for j,i in enumerate(x.split('.')[::-1])])
>>> ip2num('192.168.0.1')
3232235521

# 數字 ==> ip # 數字范圍[0, 255^4]
>>> num2ip = lambda x: '.'.join([str(x/(256**i)%256) for i in range(3,-1,-1)])
>>> num2ip(3232235521)
'192.168.0.1'

最後，python怎樣將一個整數與IP地址相互轉換？
>>> import socket
>>> import struct
>>> int_ip = 123456789
>>> socket.inet_ntoa(struct.pack(『I',socket.htonl(int_ip)))#整數轉換為ip地址
『7.91.205.21'
>>> str(socket.ntohl(struct.unpack(「I」,socket.inet_aton(「255.255.255.255″))[0]))#ip地址轉換為整數
『4294967295'

⑶ python 字元串分割split()函數中中英文逗號分割

在我這里沒有問題啊，能夠正確地分割。注意最好不要用內置函數名稱作為變數名。

⑷ Python里使用split("\r\n")分割html字元串，報錯TypeError:a bytes-like object is required,not 'str'

read()後加.decode('utf8')

⑸ Python字元串split及rsplit方法原理詳解

1.描述

split()方法通過指定分隔符對字元串進行切片，如果參數num有指定值，則分隔num+1個子字元串，默認分隔符為所有空字元，包括空格、換行(\n)、製表符(\t)等

rstrip()方法通過

2.語法

str.split([sep=None][,count=S.count(sep)])

str.rsplit([sep=None][,count=S.count(sep)])

3.參數

sep -- 可選參數，指定的分隔符，默認為所有的空字元，包括空格、換行(\n)、製表符(\t)等

count -- 可選參數，分割次數，默認為分隔符在字元串中出現的總次數

4.返回值

返回分割後的字元串列表，可以用新字元串來接收

5.實例

str1 = "Hao123 hao456 hAo789"

new_str = str1.split()

new_str2 = str1.split(' ', 1)

new_str3 = str1.rsplit(' ', 1)

print(new_str)

print(new_str2)

print(new_str3)

#輸出結果如下：

['Hao123', 'hao456', 'hAo789']

['Hao123', 'hao456 hAo789']

['Hao123 hao456', 'hAo789']

以上就是本文的全部內容，希望對大家的學習有所幫助，也希望大家多多支持。

⑹ python中split的用法問題

1、首先雙擊打開pycharm編輯工具之後，新建python文件split.py，如下圖所示。

⑺ Python常用的正則表達式處理函數詳解

正則表達式是一個特殊的字元序列，用於簡潔表達一組字元串特徵，檢查一個字元串是否與某種模式匹配，使用起來十分方便。

在Python中，我們通過調用re庫來使用re模塊：

import re

下面介紹Python常用的正則表達式處理函數。

re.match函數

re.match 函數從字元串的起始位置匹配正則表達式，返回match對象，如果不是起始位置匹配成功的話，match()就返回None。

re.match(pattern, string, flags=0)

pattern：匹配的正則表達式。

string：待匹配的字元串。

flags：標志位，用於控制正則表達式的匹配方式，如：是否區分大小寫，多行匹配等等。具體參數為：

re.I：忽略大小寫。

re.L：表示特殊字元集 w, W, , B, s, S 依賴於當前環境。

re.M：多行模式。

re.S：即 . ，並且包括換行符在內的任意字元（. 不包括換行符）。

re.U：表示特殊字元集 w, W, , B, d, D, s, S 依賴於 Unicode 字元屬性資料庫。

re.X：為了增加可讀性，忽略空格和 # 後面的注釋。

import re #從起始位置匹配 r1=re.match('abc','abcdefghi') print(r1) #不從起始位置匹配 r2=re.match('def','abcdefghi') print(r2)

運行結果：

其中，span表示匹配成功的整個子串的索引。

使用group(num) 或 groups() 匹配對象函數來獲取匹配表達式。

group(num)：匹配的整個表達式的字元串，group() 可以一次輸入多個組號，這時它將返回一個包含那些組所對應值的元組。

groups()：返回一個包含所有小組字元串的元組，從 1 到所含的小組號。

import re s='This is a demo' r1=re.match(r'(.*) is (.*)',s) r2=re.match(r'(.*) is (.*?)',s) print(r1.group()) print(r1.group(1)) print(r1.group(2)) print(r1.groups()) print() print(r2.group()) print(r2.group(1)) print(r2.group(2)) print(r2.groups())

運行結果：

上述代碼中的(.*)和(.*?)表示正則表達式的貪婪匹配與非貪婪匹配。

re.search函數

re.search函數掃描整個字元串並返回第一個成功的匹配，如果匹配成功則返回match對象，否則返回None。

re.search(pattern, string, flags=0)

pattern：匹配的正則表達式。

string：待匹配的字元串。

flags：標志位，用於控制正則表達式的匹配方式，如：是否區分大小寫，多行匹配等等。

import re #從起始位置匹配 r1=re.search('abc','abcdefghi') print(r1) #不從起始位置匹配 r2=re.search('def','abcdefghi') print(r2)

運行結果：

使用group(num) 或 groups() 匹配對象函數來獲取匹配表達式。

group(num=0)：匹配的整個表達式的字元串，group() 可以一次輸入多個組號，這時它將返回一個包含那些組所對應值的元組。

groups()：返回一個包含所有小組字元串的元組，從 1 到所含的小組號。

import re s='This is a demo' r1=re.search(r'(.*) is (.*)',s) r2=re.search(r'(.*) is (.*?)',s) print(r1.group()) print(r1.group(1)) print(r1.group(2)) print(r1.groups()) print() print(r2.group()) print(r2.group(1)) print(r2.group(2)) print(r2.groups())

運行結果：

從上面不難發現re.match與re.search的區別：re.match只匹配字元串的起始位置，只要起始位置不符合正則表達式就匹配失敗，而re.search是匹配整個字元串，直到找到一個匹配為止。

re.compile 函數

compile 函數用於編譯正則表達式，生成一個正則表達式對象，供 match() 和 search() 這兩個函數使用。

re.compile(pattern[, flags])

pattern：一個字元串形式的正則表達式。

flags：可選，表示匹配模式，比如忽略大小寫，多行模式等。

import re #匹配數字 r=re.compile(r'd+') r1=r.match('This is a demo') r2=r.match('This is 111 and That is 222',0,27) r3=r.match('This is 111 and That is 222',8,27) print(r1) print(r2) print(r3)

運行結果：

findall函數

搜索字元串，以列表形式返回正則表達式匹配的所有子串，如果沒有找到匹配的，則返回空列表。

需要注意的是，match 和 search 是匹配一次，而findall 匹配所有。

findall(string[, pos[, endpos]])

string：待匹配的字元串。

pos：可選參數，指定字元串的起始位置，默認為0。

endpos：可選參數，指定字元串的結束位置，默認為字元串的長度。

import re #匹配數字 r=re.compile(r'd+') r1=r.findall('This is a demo') r2=r.findall('This is 111 and That is 222',0,11) r3=r.findall('This is 111 and That is 222',0,27) print(r1) print(r2) print(r3)

運行結果：

re.finditer函數

和 findall 類似，在字元串中找到正則表達式所匹配的所有子串，並把它們作為一個迭代器返回。

re.finditer(pattern, string, flags=0)

pattern：匹配的正則表達式。

string：待匹配的字元串。

flags：標志位，用於控制正則表達式的匹配方式，如是否區分大小寫，多行匹配等。

import re r=re.finditer(r'd+','This is 111 and That is 222') for i in r: print (i.group())

運行結果：

re.split函數

將一個字元串按照正則表達式匹配的子串進行分割後，以列表形式返回。

re.split(pattern, string[, maxsplit=0, flags=0])

pattern：匹配的正則表達式。

string：待匹配的字元串。

maxsplit：分割次數，maxsplit=1分割一次，默認為0，不限次數。

flags：標志位，用於控制正則表達式的匹配方式，如：是否區分大小寫，多行匹配等。

import re r1=re.split('W+','This is 111 and That is 222') r2=re.split('W+','This is 111 and That is 222',maxsplit=1) r3=re.split('d+','This is 111 and That is 222') r4=re.split('d+','This is 111 and That is 222',maxsplit=1) print(r1) print(r2) print(r3) print(r4)

運行結果：

re.sub函數

re.sub函數用於替換字元串中的匹配項。

re.sub(pattern, repl, string, count=0, flags=0)

pattern：正則中的模式字元串。

repl：替換的字元串，也可為一個函數。

string：要被查找替換的原始字元串。

count：模式匹配後替換的最大次數，默認0表示替換所有的匹配。

import re r='This is 111 and That is 222' # 刪除字元串中的數字 r1=re.sub(r'd+','',r) print(r1) # 刪除非數字的字元串 r2=re.sub(r'D','',r) print(r2)

運行結果：

到此這篇關於Python常用的正則表達式處理函數詳解的文章就介紹到這了，希望大家以後多多支持！

導航:首頁 > 編程語言 > pythonrsplit函數

pythonrsplit函數

與pythonrsplit函數相關的資料