Egzact Save

Generate flexible patterns on the shell

Project README

Generate flexible patterns on the shell

How to utilize it? See examples.

New command line tools with three concepts.

Enumeration of various patterns from the standard input.
Useful equivalents for existent Linux commands (inspired by nixar).
Controlling records and fields given by particular separator (inspired by Open-Usp-Tukubai).

Installation

egzact requires Egison version 4.1.2. Following installation procedure include installation of Egison.

Linux users (RHEL compatible distros)

$ sudo yum install https://git.io/egison-4.1.2.x86_64.rpm   ## Install Egison
$ sudo yum install https://git.io/egzact-2.1.1.rpm          ## Install egzact

Linux users (Debian base distros)

$ wget https://git.io/egison-4.1.2.x86_64.deb
$ sudo dpkg -i egison-4.1.2.x86_64.deb   ## Install Egison
$ wget https://git.io/egzact-2.1.1.deb
$ sudo dpkg -i egzact-2.1.1.deb  ## Install egzact

macOS users

$ brew install egison/egison/egison  ## Install Egison
$ brew install greymd/tools/egzact   ## Install egzact

Commands

Generate multiple results from whole the STDIN

$ `conv`

Print whole the inputs as multiple rows with given number of cols. Location of each field is shifted over to the left by comparison with one upper line. The reason why the name is conv is, the behavior looks like the convolution.

$ seq 10 | conv 2
1 2
2 3
3 4
4 5
5 6
6 7
7 8
8 9
9 10

$ yes | awk '$0=NR' | conv 3 | head
1 2 3
2 3 4
3 4 5
4 5 6
5 6 7
.
.
.

$ `flat`

Print whole the inputs as multiple rows with given number of cols. In default, it just removes the new lines.

$ seq 10 | flat
1 2 3 4 5 6 7 8 9 10

The behavior is same as xargs -n N option. However Common command line options like fs can be used.

$ seq 10 | flat 2
1 2
3 4
5 6
7 8
9 10

# Comma separeted file
$ cat myfile
AA,AB,AC,AD
BA,BB,BC,BD
CA,CB,CC,CD
DA,DB,DC,DD

# Field separator(fs) option is useful for keeping comma.
$ cat myfile | flat fs=, 8
AA,AB,AC,AD,BA,BB,BC,BD
CA,CB,CC,CD,DA,DB,DC,DD

$ `slit`

Divide whole the inputs into given number of rows.

# Print A to Z with 3 rows.
$ echo {A..Z} | slit 3
A B C D E F G H I
J K L M N O P Q R
S T U V W X Y Z

# Each line's number of field is adjusted to be near each other as much as possible.
$ echo A B C D | slit 3
A B
C
D

Generate multiple results per line.

$ `stairl`

Generate sublist of the fields. Each result matches to the left side of the original input. In most cases, it looks stairs.

$ echo A B C D | stairl
A
A B
A B C
A B C D

what's going to happen if the input has multiple lines?

$ cat myfile2
AA AB AC AD
BA BB BC BD
CA CB CC CD

# The command is executed for each line.
$ cat myfile2 | stairl
AA
AA AB
AA AB AC
AA AB AC AD
BA
BA BB
BA BB BC
BA BB BC BD
CA
CA CB
CA CB CC
CA CB CC CD

# `eos` option is helpful if you want to know where each result is coming from.
$ cat myfile2 | stairl eos=---
AA
AA AB
AA AB AC
AA AB AC AD
---
BA
BA BB
BA BB BC
BA BB BC BD
---
CA
CA CB
CA CB CC
CA CB CC CD

$ `stairr`

Generate sublist of the fields. Results match to the right side of the original input. In most cases, it looks stairs.

$ echo A B C D | stairr
D
C D
B C D
A B C D

$ `sublist`

Generate all the sublist of the fields.

$ echo A B C D | sublist
A
A B
B
A B C
B C
C
A B C D
B C D
C D
D

Whole the results is same as stairl | stairr when the duplicated lines can be merged.

$ echo A B C D | stairl | stairr | sort | uniq
A
A B
A B C
A B C D
B
B C
B C D
C
C D
D

$ echo A B C D | sublist | sort | uniq
A
A B
A B C
A B C D
B
B C
B C D
C
C D
D

$ `subset`

Generate all the subsets of the fields.

$ echo A B C D | subset
A
B
C
D
A B
A C
B C
A D
B D
C D
A B C
A B D
A C D
B C D
A B C D

$ `crops`

Crop all the patterns which matches given string (regular expression). It includes all the patterns (from shortest to longest match).

$ echo 1110100110 | crops "1.*1"
11
111
11101
1101
101
11101001
1101001
101001
1001
111010011
11010011
1010011
10011

If you want to use normal grep command for matching query, stairr fs="" | stairl fs="" can works with almost same behavior. In addition, it is faster than crops because it works with multi processing.

$ echo 1110100110 | stairr fs="" | stairl fs="" | grep -o '1.*1' | sort | uniq
1001
10011
101
101001
1010011
11
1101
1101001
11010011
111
11101
11101001
111010011

$ `cycle`

Generate all the circulated patterns.

$ echo A B C D E | cycle
A B C D E
B C D E A
C D E A B
D E A B C
E A B C D

$ `comb`

Generate combinations of N of fields.

$ echo A B C D | comb 2
A B
A C
B C
A D
B D
C D

$ `perm`

Generate permutations of N of fields.

$ echo A B C D | perm 2
A B
A C
B A
A D
B C
C A
B D
C B
D A
C D
D B
D C

$ `dupl`

Duplicate lines.

$ echo A B C D | dupl 3
A B C D
A B C D
A B C D

$ `obrev`

Show given line and reversed line.

Obverse and Reverse

$ echo A B C D | obrev
A B C D
D C B A

Generate single result for each line.

$ `addl`

Add str to left side of the input.

Add + Left

$ echo abc | addl ABC
ABCabc

$ `addr`

Add str to right side of the input.

Add + Right

$ echo abc | addr ABC
abcABC

$ `mirror`

Reverse the order of the field.

$ echo A B C D | mirror
D C B A

$ `takel`

Print first N of fields.

Take + Left

$ echo A B C D | takel 3
A B C

$ `taker`

Print last N of fields.

Take + Right

$ echo A B C D | taker 3
B C D

$ `takelx`

Print fields from first one to the one which matches given regular expression.

Take + Left + regeX

$ echo QBY JCG FCM PAG TPX BQG UGB | takelx "^P.*$"
QBY JCG FCM PAG

$ `takerx`

Print fields from last one to the one which matches given regular expression.

Take + Right + regeX

$ echo QBY JCG FCM PAG TPX BQG UGB | takerx "^P.*$"
PAG TPX BQG UGB

$ `dropl`

Remove first N of fields.

Drop + Left

$ echo QBY JCG FCM PAG TPX BQG UGB | dropl 3
PAG TPX BQG UGB

$ `dropr`

Remove last N of fields.

Drop + Right

$ echo QBY JCG FCM PAG TPX BQG UGB | dropr 3
QBY JCG FCM PAG

$ `zrep`

Extract particular fields which matches given regular expression.

egZact + gREP

$ echo 1 2 3 4 5 6 7 8 9 10 | zrep "1"
1 10

$ `zniq`

Merge duplicated fields.

egZact + uNIQ

$ echo aaa bbb ccc aaa bbb | zniq
aaa bbb ccc

$ `wrap`

Add particular prefix and suffix to each field in accordance with given argument. * is the placeholder which represents each field.

$ echo aaa bbb ccc | wrap "<p>*</p>"
<p>aaa</p> <p>bbb</p> <p>ccc</p>

$ `nestl`

Nest all the fields with with given argument. * is the placeholder which represents each field. First field is the most deeply nested element.

Nest + Left

$ echo aaa bbb ccc | nestl "<p>*</p>"
<p> <p> <p> aaa </p> bbb </p> ccc </p>

$ `nestr`

Nest all the fields with with given argument. * is the placeholder which represents each field. Last field is the most deeply nested element.

Nest + Right

$ echo aaa bbb ccc | nestr "<p>*</p>"
<p> aaa <p> bbb <p> ccc </p> </p> </p>

Other commands

$ `addt`

Add str to top of the input.

Add + Top

$ echo abc | addt ABC
ABC
abc

$ `addb`

Add str to bottom of the input.

Add + Bottom

$ echo abc | addb ABC
abc
ABC

Common command line options

`fs`

Field separator.

Default value is space .
Format: fs=STR

Example

$ echo "/usr/local/var/" | stairl fs=/

/usr
/usr/local
/usr/local/var
/usr/local/var/

# In case of empty, each character is regarded as a field.
$ echo "abcdefg" | stairl fs=""
a
ab
abc
abcd
abcde
abcdef
abcdefg

`ifs`

Input field separator. If fs is already set, this option is primarily used.

Default value is space .
Format: ifs=STR

Example

$ cat myfile3
AA,AB,AC,AD
BA,BB,BC,BD

# "," separated input -> " " separated output.
$ cat myfile3 | stairr ifs=","
AD
AC AD
AB AC AD
AA AB AC AD
BD
BC BD
BB BC BD
BA BB BC BD

`ofs`

Output field separator. If fs is already set, this option is primarily used.

Default value is space .
Format: ofs=STR

Example

$ cat myfile3
AA,AB,AC,AD
BA,BB,BC,BD

# "," separated input -> "_" separated output.
$ cat myfile3 | cycle ifs="," ofs="_"
AA_AB_AC_AD
AB_AC_AD_AA
AC_AD_AA_AB
AD_AA_AB_AC
BA_BB_BC_BD
BB_BC_BD_BA
BC_BD_BA_BB
BD_BA_BB_BC

# "," separated input -> tab separated output.
$ cat myfile3 | dupl ifs="," ofs="\t" 2
AA      AB      AC      AD
AA      AB      AC      AD
BA      BB      BC      BD
BA      BB      BC      BD

`eor`

End of record (a.k.a, row). Result of each line (record) is separated with new line \n in default. This option changes the string for separating each record.

Default value is new line \n.
Format: eor=STR

Example

$ cat myfile4
AA AB AC AD
BA BB BC BD

$ cat myfile4 | stairl
AA           # End of record
AA AB        # End of record
AA AB AC     # End of record
AA AB AC AD  # End of set
BA           # End of record
BA BB        # End of record
BA BB BC     # End of record
BA BB BC BD  # End of set

$ cat myfile4 | stairr eor=" @@@ "
AD @@@ AC AD @@@ AB AC AD @@@ AA AB AC AD
BD @@@ BC BD @@@ BB BC BD @@@ BA BB BC BD

`eos`

End of set. Set means, all results generated from single line, in this manual.

Default value is new line \n.
Format: eos=STR

Example

$ cat myfile4
AA AB AC AD
BA BB BC BD


$ cat myfile4 | stairl eos="---"
AA
AA AB
AA AB AC
AA AB AC AD
---
BA
BA BB
BA BB BC
BA BB BC BD

$ cat myfile4 | stairl eos="---" eor=" @@@ " ofs=" | "
AA @@@ AA | AB @@@ AA | AB | AC @@@ AA | AB | AC | AD
---
BA @@@ BA | BB @@@ BA | BB | BC @@@ BA | BB | BC | BD

Tips

A special command line option each is available in flat, conv and slit commands. The option changes command's behavior to "each line mode". In default, those commands handle whole the standard input (STDIN). However with this option, those commands can read each line and print the result.

Example

$ cat myfile4
AA AB AC AD
BA BB BC BD

$ cat myfile4 | flat 3
AA AB AC
AD BA BB
BC BD

$ cat myfile4 | flat each 3
AA AB AC
AD
BA BB BC
BD

$ cat myfile4 | conv each 3 eos="---"
AA AB AC
AB AC AD
---
BA BB BC
BB BC BD

Uninstall

$ make uninstall

License

Software License

This software is released under the MIT License. See LICENSE

External Library

UnitTest.hs is distributed on egison/egison under the MIT license.

Logo

The logo is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Open Source Agenda is not affiliated with "Egzact" Project. README Source: greymd/egzact

Stars

Open Issues

Last Commit

1 year ago

Repository

greymd/egzact

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/egzact"><img src="https://www.opensourceagenda.com/projects/egzact/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022

Egzact Save

Generate flexible patterns on the shell

New command line tools with three concepts.

Installation

Linux users (RHEL compatible distros)

Linux users (Debian base distros)

macOS users

Commands

Generate multiple results from whole the STDIN

$ conv

$ flat

$ slit

Generate multiple results per line.

$ stairl

what's going to happen if the input has multiple lines?

$ stairr

$ sublist

$ subset

$ crops

$ cycle

$ comb

$ perm

$ dupl

$ obrev

Generate single result for each line.

$ addl

$ addr

$ mirror

$ takel

$ taker

$ takelx

$ takerx

$ dropl

$ dropr

$ zrep

$ zniq

$ wrap

$ nestl

$ nestr

Other commands

$ addt

$ addb

Common command line options

fs

ifs

ofs

eor

eos

Tips

Uninstall

License

Software License

External Library

Logo

Open Source Agenda Badge

From the blog

How to Choose Which Programming Language to Learn First?

From the blog

How to Choose Which Programming Language to Learn First?

$ `conv`

$ `flat`

$ `slit`

$ `stairl`

$ `stairr`

$ `sublist`

$ `subset`

$ `crops`

$ `cycle`

$ `comb`

$ `perm`

$ `dupl`

$ `obrev`

$ `addl`

$ `addr`

$ `mirror`

$ `takel`

$ `taker`

$ `takelx`

$ `takerx`

$ `dropl`

$ `dropr`

$ `zrep`

$ `zniq`

$ `wrap`

$ `nestl`

$ `nestr`

$ `addt`

$ `addb`

`fs`

`ifs`

`ofs`

`eor`

`eos`