第87回php勉強会 lt 知って得するかもしれないテキスト処理コマンドのお話
TRANSCRIPT
-
@serima
-
| PHP
8
PHP
-
mysqldump
scp
-
TreasureData
TreasureData
-
Excel
-
PHP
-
UNIX
-
cat, sort, uniq, wc, grep, awk, di, cut, sed
-
join
key
sort
-
join
a.txt ID, ,
b.txt ID,
join -a1 -e- -o0 1.2 1.3 2.2
-
|
-
|
cat a.txt | sort > a_sorted.txt
-
A B ID
1 2 3
1 3
2
A B
-
di awk
di
-
AB
cat A B | sort | uniq
AB
cat A B | sort | uniq -d
join A B
-
join
join
-
tsv csv query
q - Text as Data
https://github.com/harelba/q
q
-
% seq 1 1000 | q "select avg(c1),sum(c1) from -" 500.5 500500
c11 avg(c1)11000 sum(c1)11000
-
GROUP BY
% cat test.txt Mike 10 John 1 Bob 8 Mary 19 Bob 15 Bob 20 Mike 2 % q "select c1, count(*), sum(c2) from test.txt group by c1" Bob 3 43 John 1 1 Mary 1 19 Mike 2 12
-
JOIN
% cat test1.txt a1234 aa1 b1234 aa2 c1234 aa3 d1234 aa3 !% cat test2.txt 1 a1234 bb1 2 a1234 bb2 3 b1234 cc1 4 c1234 dd3 5 e1234 NULL !% q "select * from test1.txt as a left join test2.txt as b on a.c1 = b.c2" a1234 aa1 1 a1234 bb1 a1234 aa1 2 a1234 bb2 b1234 aa2 3 b1234 cc1 c1234 aa3 4 c1234 dd3 d1234 aa3
-
http://serima.co/blog