java判斷是否英文字元_Java判斷字元串是中文還是英文

❶ 如何判斷java中char是中文字元還是英文字元

Java文件流有字元流和位元組流兩種，分別對應char和byte類型
如果是字元流的read，一轎銀次讀取兩個位元組，腔差也就是一個char，需要注意的是Java採用Unicode編碼，無論中文還是西文只要閉圓宴是char類型都是2位元組。英文字元在Unicode以asc碼存儲，高位應該是0.使用這種方法需注意若位元組為奇數則可能出錯。
如果是位元組流，一次性讀1個位元組，適用於任何場景，尤其是圖片等二進制文件的讀取，缺點是對文本文件識別率不高

❷ Java判斷字元串是中文還是英文

Java中判斷字元串的編碼有兩種思路：
一種是根據byte的長度判斷，英文的字母數字好標點符號都是一個byte，且值在0-255之間
另一種是根據中文的Unicode取值范圍判斷，這個就是把所以的范圍都包含，才能判斷正確，參考unicode中文范圍：

示例代碼：
import java.util.regex.Matcher;
import java.util.regex.Pattern;
public class StringTest {
//英文佔1byte，非英文（可認為是中文）佔2byte，根據這個特性來判斷字元
public static boolean checkChar(char ch) {
if ((ch + "").getBytes().length == 1) {
return true;//英文
} else {
return false;//中文
}
}
public static String checkString(String str) {
String res = "";
if (str != null) {
for (int i = 0; i < str.length(); i++) {
//只要字元串中有中文則為中文
if (!checkChar(str.charAt(i))) {
res = "中文";
break;
} else {
res = "英文";
}
}
}
return res;
}
//判斷是不是中文
public static boolean isChinese(char c) {
Character.UnicodeBlock ub = Character.UnicodeBlock.of(c);
if (ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS
|| ub == Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS
|| ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A
|| ub == Character.UnicodeBlock.GENERAL_PUNCTUATION
|| ub == Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION
|| ub == Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS) {
return true;
}
return false;
}
//判斷是不是英文字母
public static boolean isEnglish(String charaString) {
return charaString.matches("^[a-zA-Z]*");
}
//根據中文unicode范圍判斷u4e00 ~ u9fa5不全
public static String isChinese(String str) {
String regEx1 = "[\\u4e00-\\u9fa5]+";
String regEx2 = "[\\uFF00-\\uFFEF]+";
String regEx3 = "[\\u2E80-\\u2EFF]+";
String regEx4 = "[\\u3000-\\u303F]+";
String regEx5 = "[\\u31C0-\\u31EF]+";
Pattern p1 = Pattern.compile(regEx1);
Pattern p2 = Pattern.compile(regEx2);
Pattern p3 = Pattern.compile(regEx3);
Pattern p4 = Pattern.compile(regEx4);
Pattern p5 = Pattern.compile(regEx5);
Matcher m1 = p1.matcher(str);
Matcher m2 = p2.matcher(str);
Matcher m3 = p3.matcher(str);
Matcher m4 = p4.matcher(str);
Matcher m5 = p5.matcher(str);
if (m1.find() || m2.find() || m3.find() || m4.find() || m5.find())
return "中文";
else
return "英文";
}
public static void main(String[] args) {
System.out.println("使用長度判斷:");
System.out.println(checkString("Hello++"));
System.out.println(checkString("Hello++。、，？"));
System.out.println(checkString("Hello++編程"));
System.out.println(checkString("編程"));

System.out.println("\r\n使用正則表達式判斷:");
System.out.println(isChinese("Hello++"));
System.out.println(isChinese("Hello++。、，？"));
System.out.println(isChinese("Hello++編程"));
System.out.println(isChinese("編程"));

System.out.println("\r\n使用Character.UnicodeBlock");
System.out.println(isChinese('h')?"中文":"英文");
System.out.println(isChinese(',')?"中文":"英文");
System.out.println(isChinese('。')?"中文":"英文");
System.out.println(isChinese('編')?"中文":"英文");
}
}

熱點內容

ubuntu圖形編程發布：2025-09-18 17:11:54 瀏覽：439

jar包啟動命令發布：2025-09-18 17:01:54 瀏覽：678

java數組一維轉二維發布：2025-09-18 17:01:44 瀏覽：498

office批量轉pdf 發布：2025-09-18 16:54:55 瀏覽：183

boss直聘程序員多少薪發布：2025-09-18 16:50:24 瀏覽：631

編程字母代表什麼發布：2025-09-18 16:44:25 瀏覽：952

rainmc伺服器地址發布：2025-09-18 16:43:03 瀏覽：456

電信校園網客戶端認證伺服器地址發布：2025-09-18 16:35:37 瀏覽：448

掌閱怎麼看文件夾發布：2025-09-18 16:30:36 瀏覽：341

在伴伴app裡面怎麼拜師傅發布：2025-09-18 16:00:27 瀏覽：945

編程珠璣筆記發布：2025-09-18 14:51:03 瀏覽：281

結束命令行發布：2025-09-18 14:48:53 瀏覽：270

力學原理pdf 發布：2025-09-18 14:40:05 瀏覽：737

宏定義編譯後不變發布：2025-09-18 14:09:53 瀏覽：404

如何搞免費伺服器發布：2025-09-18 14:07:58 瀏覽：213

神經系統pdf 發布：2025-09-18 14:02:00 瀏覽：672

如何查看伺服器上的資料庫伺服器發布：2025-09-18 13:53:50 瀏覽：195

壓縮機型號v代表什麼發布：2025-09-18 13:27:32 瀏覽：58

旅遊類源碼發布：2025-09-18 13:24:27 瀏覽：867

電腦伺服器類型怎麼設置發布：2025-09-18 13:16:01 瀏覽：235

導航:首頁 > 編程語言 > java判斷是否英文字元

java判斷是否英文字元

與java判斷是否英文字元相關的資料