I'm trying to extract data from a text file with the following structure:
我正在尝试使用以下结构从文本文件中提取数据:
Employee: John C.
2013-01-01 10 $123
2013-01-02 12 $120
2013-01-03 8 $150
Employee: Michael G.
2013-01-01 5 $13
2013-01-05 11 $20
2013-01-10 2 $155
As you can see, the pattern is a table header containing the Employee name and then table content containing all of its transactions, then the pattern repeats.
如您所见,模式是一个包含Employee名称的表头,然后是包含其所有事务的表内容,然后重复模式。
To extract transactions I do this:
要提取交易,我这样做:
awk '/^ [A-Z]/{print $1"\t"$2"\t"$3}'
This gives this result:
这给出了这个结果:
2013-01-01 10 $123
2013-01-02 12 $120
2013-01-03 8 $150
2013-01-01 5 $13
2013-01-05 11 $20
2013-01-10 2 $155
How can I create a two pass extraction that returns this:
如何创建一个返回此的两遍提取:
2013-01-01 10 $123 John C.
2013-01-02 12 $120 John C.
2013-01-03 8 $150 John C.
2013-01-01 5 $13 Michael G.
2013-01-05 11 $20 Michael G.
2013-01-10 2 $155 Michael G.
2 个解决方案
#1
5
One way with awk
:
awk的一种方法:
awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file
Test:
$ cat file
Employee: John C.
2013-01-01 10 $123
2013-01-02 12 $120
2013-01-03 8 $150
Employee: Michael G.
2013-01-01 5 $13
2013-01-05 11 $20
2013-01-10 2 $155
$ awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file
2013-01-01 10 $123 John C.
2013-01-02 12 $120 John C.
2013-01-03 8 $150 John C.
2013-01-01 5 $13 Michael G.
2013-01-05 11 $20 Michael G.
2013-01-10 2 $155 Michael G.
#2
2
Code for GNU sed:
GNU sed代码:
sed '/:/{s/[^:]\+://;H;x;s/.*\n//;d};G;s/\n//' file
#1
5
One way with awk
:
awk的一种方法:
awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file
Test:
$ cat file
Employee: John C.
2013-01-01 10 $123
2013-01-02 12 $120
2013-01-03 8 $150
Employee: Michael G.
2013-01-01 5 $13
2013-01-05 11 $20
2013-01-10 2 $155
$ awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file
2013-01-01 10 $123 John C.
2013-01-02 12 $120 John C.
2013-01-03 8 $150 John C.
2013-01-01 5 $13 Michael G.
2013-01-05 11 $20 Michael G.
2013-01-10 2 $155 Michael G.
#2
2
Code for GNU sed:
GNU sed代码:
sed '/:/{s/[^:]\+://;H;x;s/.*\n//;d};G;s/\n//' file