java自动根据文件内容的编码来读取避免乱码

yaerfeng1989

浏览: 225418 次
性别:
来自: 北京

最近访客更多访客>>

yunzhu

foxinmy

lixiaoxin

xubbsun

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

2014-07 ( 26)
2014-06 ( 28)
2014-05 ( 30)
更多存档...

博客分类：

乱码
java

乱码编码自动 cpdetector

通过cpdetector这个开源的jar包可以自动判断当前文件的内容编码，从而在读取的时候选择正确的编码读取，避免乱码问题。

原创不易，转载请注明出处:java自动根据文件内容的编码来读取避免乱码

测试结果,提供截图:

GBK文件内容

UTF8文件内容

运行结果:

package com.zuidaima.test;

import info.monitorenter.cpdetector.io.ASCIIDetector;
import info.monitorenter.cpdetector.io.CodepageDetectorProxy;
import info.monitorenter.cpdetector.io.JChardetFacade;
import info.monitorenter.cpdetector.io.ParsingDetector;
import info.monitorenter.cpdetector.io.UnicodeDetector;

import java.io.BufferedReader;
import java.io.File;
import java.io.FileInputStream;
import java.io.InputStreamReader;

public class Main {

	public static String getContent(String path) throws Exception {
		File file = new File(path);
		CodepageDetectorProxy detector = CodepageDetectorProxy.getInstance();
		detector.add(new ParsingDetector(false));
		detector.add(JChardetFacade.getInstance());
		detector.add(ASCIIDetector.getInstance());
		detector.add(UnicodeDetector.getInstance());
		java.nio.charset.Charset charset = null;
		try {
			charset = detector.detectCodepage(file.toURI().toURL());
		} catch (Exception ex) {
			ex.printStackTrace();
		}
		String charsetName = null;
		if (charset != null) {
			charsetName = charset.name();
		} else {
			charsetName = "UTF-8";
		}
		BufferedReader reader = new BufferedReader(new InputStreamReader(
				new FileInputStream(file), charsetName));
		String line = null;
		String lines = "";
		while ((line = reader.readLine()) != null) {
			lines += line + "\n";
		}
		reader.close();
		return lines;
	}

	public static void main(String[] args) throws Exception {
		System.out.println(getContent("bin/gbk.txt"));
		System.out.println(getContent("bin/utf8.txt"));
	}
}

代码下载地址:http://www.zuidaima.com/share/1550463235574784.htm

0
顶

3
踩

分享到：

分享java读写Properties文件 | java循环某年某月的所有天数

2014-03-10 09:54
浏览 1043
评论(1)
分类:编程语言
查看更多

1 楼 ray_linn 2014-03-10

Text文件头是可省略的，所有检测都是不靠谱的，只是猜测而已。

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

java自动根据文件内容的编码来读取避免乱码

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

java自动根据文件内容的编码来读取避免乱码

评论

发表评论

相关推荐

java H2数据库使用并实现增删改查功能

java定时任务类Timer和TimerTask用法详解

java swing调用webservice实现qq在线查询是否在线

java汉字转换为拼音

Java 扑克发牌算法实现

java基数排序算法代码下载

java桶式排序算法代码下载

java rmi服务器端客户端传输数据实例教程

java冒泡排序Bubble Sort算法代码

分享java的Serializable功能

java查看windows的磁盘空间大小信息

java文件操作之移动文件到指定的目录

java修改文件为只读权限

java运行bat命令得到某个windows文件的创建时间

java文件操作之FileWriter用法，向文件尾插入内容

java socket控制台版本聊天室程序源码下载

通过codehaus来实现json写入文件和读取文件成json对象

springmvc如何将form表单中的对象类型绑定

java qr二维条码生成器

java calendar循环某年某月的所有天数

最近访客更多访客>>