public static List<String> run(String text) {
String dateStr = text.replaceAll("r?n", " ");
List<String> dates = new ArrayList<String>();
try {
Pattern p = Pattern.compile("(\\d{1,4}[-|\\/|年|\\.|\\s]?(0[1-9]|1[1-2])[-|\\/|月|\\.|\\s]?(0[1-9]|[1-2]\\d|3[0-1]))", Pattern.CASE_INSENSITIVE|Pattern.MULTILINE);
Matcher matcher = p.matcher(dateStr);
while (matcher.find()) {
String temp = matcher.group();
dates.add(temp);
}
} catch (Exception e) {
}
return dates;
}
只能提取同时具有年月日的日期,但可以匹配多种日期的连接方式,可以根据需要进行更改。提取之后可以通过以下方法将时间字符串转换成标准格式。代码如下
public static List<String> getDates(List<String> list){
List<String> dates = new ArrayList<String>();
for (String date : list) {
String[] strs = date.split("[-|\\/|年|月|\\.|\\s]");
String time = "";
for (String s : strs) {
time += s;
}
dates.add(time);
}
return dates;
}