Uploaded by Andy Tan

Watsons Game theory

advertisement
ASM Sc. J., 13, 2020
https://doi.org/10.32802/asmscj.2020.sm26(5.1)
1-Normal Form for Static Watson-Crick Regular and
Linear Grammars
Aqilahfarhana Abdul Rahman1, Wan Heng Fong2*, Nor Haniza Sarmin3 and Sherzod Turaev4
1,2,3Department of Mathematical Sciences, Faculty of Science, Universiti Teknologi Malaysia, 81310 UTM Skudai,
Johor, Malaysia.
4Sherzod Turaev Department of Computer Science & Software Engineering, College of Information Technology,
United Arab Emirates University, Al Ain 15551, United Arab Emirates
In DNA computing, there are various formal language theoretical approaches that involves the
recombinant behavior of DNA sequences. A Watson-Crick automaton is a mathematical model that
represents the biological properties of DNA based on the Watson-Crick complementarity of DNA
molecules. Meanwhile, a sticker system is another DNA computing models which uses the sticker
operation to form complete double-stranded sequences. Previously, Watson-Crick grammars have been
introduced as the grammar counterparts of Watson-Crick automata which generate double-stranded
strings using the Watson-Crick complementarity rules. Following that, static Watson-Crick grammars
are introduced, where both stranded strings are generated dependently by checking for the WatsonCrick complementarity of each complete substring. In formal language theory, normal forms, such as
Chomsky Normal Form (CNF) are defined by imposing the restrictions on the rules contained in
context-free grammars. However, in previous research, 1-normal form for a Watson-Crick linear
grammar was defined. In this research, 1-normal forms are introduced for both static Watson-Crick
regular and linear grammars. Moreover, the implementation of 1-normal form is also presented by
investigating the computational properties between the static Watson-Crick regular and linear
grammars. The results from this research, hence, simplify the length of the rules in the grammars,
which are useful for studying computational properties of Watson-Crick grammars.
Keywords: DNA computing, formal languages, context-free grammars, static Watson-Crick
grammars, normal form
I.
INTRODUCTION
nitrogenous base. The four nitrogenous bases are adenine
(A), thymine (T), guanine (G), and cytosine (C). The
DNA (Deoxyribonucleic acid) computing contains various
formal language theoretical approaches that broadly use
the recombinant behavior of DNA sequences under the
effect of enzymatic activities. DNA is a polymer which is
constructed
from
monomers
namely
deoxyribonucleotides.
Each
deoxyribonucleotides
adenine (A) and guanine (G) bases are double-ring
molecules called purines; whereas the cytosine (C) and
thymine (T) bases are single-ring molecules called
pyrimidine (P ãun et al., 1998).
A DNA molecule is composed of two DNA strands
which are held together by the hydrogen bonds between
consists
of
three
components; a sugar, a phosphate group, and a
_________
*Corresponding author’s e-mail: fwh@utm.my
the paired bases. There are two fundamental features of
DNA
molecules
known
as
Watson-Crick
(WK)
ASM Science Journal, Volume 13, 2020
complementarity and massive parallelism of DNA strands.
forms of context-free grammars are the Chomsky Normal
WK complementarity is a base pairing where a purine
Form and Greibach Normal Form (Levelt, 1974). For WK
always binds with a pyrimidine, but each purine binds to
grammars, Zulkufli et al. (2016) have introduced 1-normal
one particular type of pyrimidine only; meanwhile,
form for WK linear grammars and showed that for every
massive parallelism of DNA strands allows construction of
WK linear grammar, there exist an equivalent WK linear
many copies of DNA strands where numerous operations
grammar in 1-normal form. In this research, we
are
investigate the 1-normal form for each static WK regular
carried
out
on
the
encoded
information
and linear grammars.
simultaneously (P ãun et al., 1998; Ng et al., 2007).
DNA computation has been marked by Adleman
This paper is organized as follows: Section 1
(1994). By using the DNA strand in his experiment, he
introduces the background of the research. In Section 2,
was able to solve the Hamiltonian path problem (HPP) for
some preliminary concepts involve the basic terms of
a simple graph with the method of sticker operation.
strings, languages and grammars, sticker systems as well
Then, Kari et al. (1998) proposed a mathematical model
as dynamic and static WK grammars are presented. In
known as a sticker system which uses the sticker
Section 3, the normal forms for static WK regular and
operation on DNA to form complete double-stranded
linear grammars are introduced.
sequences. Following that, Freund et al. (1997) proposed
the Watson-Crick automata (WKA) which is one of the
II.
PRELIMINARIES
mathematical models used in DNA computation. WKA is
an extension of finite automata with the addition of two
This section includes some preliminary concepts which
reading heads on double-stranded sequences.
involves the basic terms and definitions that are used in
The grammatical studies of DNA strands started in
this paper. The reader may refer to (P ãun et al., 1998;
2012 when Subramanian et al. (2012) introduced the WK
L i n z , 2 0 0 6 ) for detailed information regarding on the
regular grammar, and later, modified variants for
basic concepts of strings, languages, grammars and sticker
different types of grammars were defined in (Zulkufli et
systems.
al., 2016). The research is motivated by the synthesis
In this paper, the symbol ∈ denotes the membership of
processes in DNA replication which can be simulated by
an element to a set. Let 𝑇 be an alphabet which is a
derivations in the WK grammars. Although these WK
nonempty finite set of abstract symbols, then 𝑇 ∗ is a set of
grammars use different restriction of production rules, all
all strings, a finite sequence of symbols (words) over 𝑇. A
of them generate double-stranded strings dynamically:
string with no symbol is called the empty string and
the WK complementarity can only be checked once
denoted by πœ†. The set 𝑇 + is defined as the set of all
generating both strands of a complete double-stranded
nonempty finite strings over 𝑇, i.e., 𝑇 + = 𝑇 ∗ − πœ†.
string. Motivated by the WK grammars, the static WK
A Chomsky grammar is defined as a quadruple 𝐺 =
grammars are proposed as a grammar counterpart of the
(𝑁, 𝑇, 𝜌, 𝑆, 𝑃) where the alphabet 𝑁 is defined as the
sticker systems (Abdul Rahman et al., 2018a; 2018b). This
nonterminal alphabet, 𝑇 is the terminal alphabet, 𝑆 ∈ 𝑁
new theoretical model generates both stranded strings
is the axiom or the start symbol, and 𝑃 ⊆ (𝑁 ∪ 𝑇)∗ 𝑁(𝑁 ∪
dependently by checking for WK complementarity of each
𝑇)∗ is a set of production rules of 𝐺. The rules (π‘₯, 𝑦) ∈ 𝑃
complete substring and also illustrate the replication of
are written in the form of π‘₯ → 𝑦. We say that 𝑒 directly
DNA in DNA molecules.
derives 𝑣 or 𝑣 is derived from 𝑒 with respect to 𝐺, which is
Next, a normal form is defined by imposing the
written as 𝑒 ⇒ 𝑣, if and only if 𝑒 = 𝑒1 π‘₯𝑒2 , 𝑣 = 𝑒1 𝑦𝑒2 , for
restrictions of the rules contained in the grammar. In
some 𝑒1 , 𝑒2 ∈ (𝑁 ∪ 𝑇)∗ and π‘₯ → 𝑦 ∈ 𝑃. The reflexive and
Chomsky grammars, the normal forms are implemented
for
context-free
grammars
and
transitive closure of ⇒ is denoted by ⇒∗ .
context-sensitive
A grammar can normally generate many strings by
grammars (Ito et al., 2010). The most important normal
applying the rules in arbitrary order. The set of all
2
ASM Science Journal, Volume 13, 2020
terminal strings is the language generated by the grammar
𝐿𝜌 (𝑉) = ((
which is defined by 𝐿(𝐺) = {𝑀 ∈ 𝑇 ∗ : 𝑆 ⇒∗ 𝑀}.
𝑉 ∗
πœ†
𝑉∗
π‘…πœŒ (𝑉) = [ ] (( ∗ ) ∪ ( )),
π‘‰πœŒ 𝑉
πœ†
The Chomsky grammars are classified depending on
their respective form of production rules. A grammar 𝐺 =
(𝑁, 𝑇, 𝜌, 𝑆, 𝑃) is called context-sensitive, if each rule 𝑒 →
πΏπ‘…πœŒ (𝑉) = ((
𝑣 ∈ 𝑃 has 𝑒 = 𝑒1 𝐴𝑒2 , 𝑣 = 𝑒1 π‘₯𝑒2 for 𝑒1 , 𝑒2 ∈ (𝑁 ∪ 𝑇)∗ , 𝐴 ∈
according to our grammar, where
has 𝑒 ∈ 𝑁; linear, if each rule 𝑒 → 𝑣 ∈ 𝑃 has 𝑒 ∈ 𝑁 and 𝑣 ∈
πΏπ‘…πœŒ∗ (𝑇) = ((
𝑇 ∗ ∪ 𝑇 ∗ 𝑁𝑇 ∗ ; regular, if each rule 𝑒 → 𝑣 ∈ 𝑃 has 𝑒 ∈ 𝑁
and 𝑣 ∈ 𝑇 ∪ 𝑇𝑁 ∪ {πœ†} (P aΜƒun et al., 1998).
πΏπ‘…πœŒ+ (𝑇) = ((
All those families of languages generated by contextsensitive, context-free, linear and regular grammars are
𝑇∗
πœ†
𝑇∗
πœ†
𝑇∗
∗ ) ∪ ( )) [𝑇 ] (( ∗ ) ∪ ( )),
𝑇
πœ†
𝑇
πœ†
𝜌
𝑇+
πœ†
𝑇∗
πœ†
𝑇∗
) ∪ ( )) [ ] (( ∗ ) ∪ ( )),
π‘‡πœŒ 𝑇
𝑇∗
πœ†
πœ†
and the alphabet 𝑉 which is defined in π‘ŠπœŒ (𝑉) is changed to
denoted as CS, CF, LIN and REG respectively. Other
alphabet 𝑇 according to the definition in the Chomsky
than that, RE and FIN represent the family of recursive
grammar.
enumerable languages, i.e., arbitrary languages and finite
Next, we recall the definition of static WK regular
languages.
grammar and static WK linear grammar. Since static WK
Further, we recall the definitions of Watson-Crick
regular grammar consists of right-linear and left-linear
grammars:
grammar, then we state only for right-linear grammar
(Rahman et al., 2018a) in this paper.
Definition 1. (Zulkufli et al., 2016)
A Watson-Crick (WK) grammar 𝐺 = (𝑁, 𝑇, 𝜌, 𝑆, 𝑃) is called
Definition 2. (Rahman et al., 2018a)
regular, if each production has the form 𝐴 → ⟨𝑒/𝑣⟩ where
A static Watson-Crick right-linear grammar is a 5-tuple
𝐴, 𝐡 ∈ 𝑁 and ⟨𝑒/𝑣⟩ ∈ ⟨𝑇 ∗ /𝑇 ∗ ⟩; linear, if each production
𝐺 = (𝑁, 𝑇, 𝜌, 𝑆, 𝑃) where 𝑁, 𝑇 are disjoint nonterminal and
has the form 𝐴 → ⟨𝑒1 /𝑣1 ⟩𝐡⟨𝑒2 /𝑣2 ⟩ or 𝐴 → ⟨𝑒/𝑣⟩ where
and ⟨𝑒/𝑣⟩, ⟨𝑒1 /𝑣1 ⟩, ⟨𝑒2 /𝑣2 ⟩ ∈ ⟨𝑇 ∗ /𝑇 ∗ ⟩;
terminal alphabets respectively, 𝜌 ∈ 𝑇 × π‘‡ is a symmetric
context-
relation (Watson-Crick complementarity), 𝑆 ∈ 𝑁 is a start
free, if each production has the form 𝐴 → 𝛼 where 𝐴 ∈ 𝑁
symbol (axiom) and 𝑃 is a finite set of production rules in
and 𝛼 ∈ (𝑁 ∪ ⟨𝑇 ∗ /𝑇 ∗ ⟩)∗ .
the form of
𝑒 π‘₯
𝑒 π‘₯
(i) 𝑆 → [ ] (𝑦) 𝐴 where 𝐴 ∈ 𝑁 − {𝑆}, [ ] (𝑦) ∈ π‘…πœŒ (𝑇);
𝑣
𝑣
π‘₯
π‘₯
(ii) 𝐴 → (𝑦) 𝐡 where 𝐴, 𝐡 ∈ 𝑁 − {𝑆}, (𝑦) ∈ πΏπ‘…πœŒ ∗ (𝑇); or
In order to generate or form a complete doublestranded sequence of DNA, the sticker system uses a
sticker
𝑉 +
πœ†
𝑉∗
πœ†
𝑉∗
∗ ) ∪ ( )) [𝑉 ] (( ∗ ) ∪ ( )).
𝑉
𝑉
πœ†
πœ†
𝜌
In this research, the definition of πΏπ‘…πœŒ (𝑉) is modified
𝑁 and π‘₯ ∈ (𝑁 ∪ 𝑇)+ ; context-free, if each rule 𝑒 → 𝑣 ∈ 𝑃
𝐴, 𝐡 ∈ 𝑁
𝑉 ∗
πœ†
𝑉∗
∗ ) ∪ ( )) [𝑉 ] ,
𝑉
πœ†
𝜌
operation
on
DNA
molecules
(P ãun
and
π‘₯ 𝑒
π‘₯ 𝑒
(iii) 𝐴 → (𝑦) [ ] where 𝐴 ∈ 𝑁 − {𝑆}, (𝑦) [ ] ∈ 𝐿𝜌 (𝑇).
𝑣
𝑣
Rozenberg, 1998). Let 𝑉 be an alphabet for a symmetric
𝑉 ∗
relation 𝜌 ∈ 𝑉 × π‘‰ over 𝑉. The set π‘ŠπΎπœŒ (𝑉) = [ ] where
π‘‰πœŒ
Definition 3. (Rahman et al., 2018b)
π‘Ž
π‘Ž
𝑉
[ ] = {[ ] | π‘Ž, 𝑏 ∈ 𝑉, ( ) ∈ 𝜌} denotes the Watson-Crick
𝑏
𝑏
π‘‰πœŒ
A static Watson-Crick linear grammar is a 5-tuple 𝐺 =
(𝑁, 𝑇, 𝜌, 𝑆, 𝑃) where 𝑁, 𝑇 are disjoint nonterminal and
domain associated to alphabet 𝑉 and the complementarity
terminal alphabets respectively, 𝜌 ∈ 𝑇 × π‘‡ is a symmetric
𝑀1
relation 𝜌. The elements [𝑀 ] ∈ π‘ŠπΎπœŒ (𝑉) are called well2
relation (Watson-Crick complementarity), 𝑆 ∈ 𝑁 is a start
formed double-stranded sequences. The strings 𝑀1 is the
symbol (axiom) and 𝑃 is a finite set of production rules in
upper strand and 𝑀2 is the lower strand of the molecule.
the form of
π‘₯2 𝑒2
𝑒1 π‘₯1
(i) 𝑆 → [𝑣 ] (𝑦 ) 𝐴 (𝑦 ) [𝑣 ]
Apart from that, the set of incomplete molecules are
1
denoted as π‘ŠπœŒ (𝑉) = 𝐿𝜌 (𝑉) ∪ 𝑅(𝑉) ∪ πΏπ‘…πœŒ (𝑉), where
1
2
2
where
π‘₯2 𝑒2
𝑒1 π‘₯1
[𝑣 ] (𝑦 ) ∈ π‘…πœŒ (𝑇), (𝑦 ) [𝑣 ] ∈ 𝐿𝜌 (𝑇);
1
2
1
2
3
𝐴 ∈ 𝑁 − {𝑆},
ASM Science Journal, Volume 13, 2020
π‘₯1
π‘₯2
π‘₯1
π‘₯2
(ii) 𝐴 → (𝑦 ) 𝐡 (𝑦 ) where 𝐴, 𝐡 ∈ 𝑁 − {𝑆}, (𝑦 ) , (𝑦 ) ∈
1
2
1
2
π‘₯ 𝑒
π‘₯ 𝑒
(iii) 𝑃3 = {𝐴 → (𝑦) [ ] ∈ 𝑃 | (𝑦) [ ] ∈ 𝐿𝜌 (𝑇), 𝐴 ∈ 𝑁 −
𝑣
𝑣
πΏπ‘…πœŒ ∗ (𝑇); or
{𝑆}};
π‘₯1
π‘₯1
(iii) 𝐴 → (𝑦 ) where 𝐴 ∈ 𝑁 − {𝑆}, (𝑦 ) ∈ πΏπ‘…πœŒ ∗ (𝑇).
1
1
i.e., 𝑃 = 𝑃1 ∪ 𝑃2 ∪ 𝑃3 . Without loss of generality, consider
Next, the 1-normal form for static WK regular and
π‘₯
π‘₯
π‘Ž π‘Ž β‹― π‘Žπ‘˜
)
π‘Ÿ: 𝐴 → (𝑦) 𝐡 where (𝑦) = ( 1 2
πœ†
linear grammars are discussed in the following section.
𝑏 𝑏 β‹― 𝑏𝑛
πœ†
[ 1 2
](
). We construct the sequence of
𝑐1 𝑐2 β‹― 𝑐𝑛 𝑑1 𝑑2 β‹― π‘‘π‘š
III.
RESULTS AND DISCUSSIONS
new
The normal form represents the standardized form for the
production rules in the grammars. Therefore in this
section, we define a 1-normal form for static WK regular
grammar
as
stated
π‘Žπ‘˜
𝑏
𝑏
) 𝐡 π‘˜ , π΅π‘Ÿ π‘˜ → [ 1 ] πΆπ‘Ÿ 1 , β‹― , πΆπ‘Ÿ 𝑛−1 → [ 𝑛 ] πΆπ‘Ÿ 𝑛 , πΆπ‘Ÿ 𝑛 →
𝑐1
𝑐𝑛
πœ† π‘Ÿ
(
𝑑
𝑑1
) π·π‘Ÿ 1 , β‹― , π·π‘Ÿ π‘š−1 → ( π‘š ) 𝐡 where π΅π‘Ÿ 𝑖 , πΆπ‘Ÿ 𝑗 and π·π‘Ÿ π‘˜ are
πœ†
πœ†
(𝑁′, 𝑇, 𝜌, 𝑆, 𝑃′) where 𝑁′ contains the nonterminals of 𝑁
1, |𝑣𝑖 | ≤ 1. The following s h o w s a 1 - n o r m a l f o r m f or
regular
(
Next, we define a static WK regular grammar 𝐺′ =
lower strand is less than or equal to one, such that |𝑒𝑖 | ≤
WK
π‘Ž
{π‘Ÿ}: 𝐴 → ( 1 ) π΅π‘Ÿ 1 , β‹― , π΅π‘Ÿ π‘˜−1 →
πœ†
new nonterminals that are only used in the rule π‘Ÿ.
and linear grammars where the order of each upper and
st a t i c
productions
and all new nonterminals introduced above and 𝑃′
in
contains the productions constructed above. Hence, every
Definition 4 and Lemma 1.
production π‘Ÿ in 𝑃 can be replaced with the corresponding
Definition 4.
sequence {π‘Ÿ} of productions in 𝑃′ and vice versa.
A static WK regular grammar 𝐺 =
Therefore, 𝐿(𝐺) = 𝐿(𝐺 ′ ).
(𝑁, 𝑇, 𝜌, 𝑆, 𝑃) is said to be in 1-normal form if each
Next, the following definition and lemma of 1-
production in 𝑃 has one of the following forms:
normal form for static WK linear grammars are given.
π‘Ž
π‘Ž
(i) 𝐴 → ( ) 𝐡 | ( ),
πœ†
πœ†
πœ†
πœ†
(ii) 𝐴 → ( ) 𝐡 | ( ),
π‘Ž
π‘Ž
Definition 5.
A static WK linear grammar 𝐺 =
(𝑁, 𝑇, 𝜌, 𝑆, 𝑃) is said to be in 1-normal form if each
πœ†
πœ†
(iii) 𝐴 → ( ) 𝐡 | ( ),
πœ†
πœ†
π‘Ž
π‘Ž
(iv) 𝐴 → [ ] 𝐡 | [ ],
𝑏
𝑏
production in 𝑃 has one of the following forms:
π‘Ž
π‘Ž
π‘Ž
(i) 𝐴 → ( ) 𝐡 | 𝐡 ( ) | ( ),
πœ†
πœ†
πœ†
where 𝐴, 𝐡 ∈ 𝑁 and (π‘Ž, 𝑏) ∈ 𝜌.
πœ†
πœ†
πœ†
(ii) 𝐴 → ( ) 𝐡 | 𝐡 ( ) | ( ),
π‘Ž
π‘Ž
π‘Ž
πœ†
πœ†
πœ†
(iii) 𝐴 → ( ) 𝐡 | 𝐡 ( ) | ( ),
πœ†
πœ†
πœ†
π‘Ž
π‘Ž
π‘Ž
(iv) 𝐴 → [ ] 𝐡 | 𝐡 [ ] | [ ],
𝑏
𝑏
𝑏
Lemma 1. For every static WK regular grammar, there
exists a static WK regular grammar in 1-normal form.
Proof.
β– 
Let 𝐺 = (𝑁, 𝑇, 𝜌, 𝑆, 𝑃) be a static WK regular
where 𝐴, 𝐡 ∈ 𝑁 and (π‘Ž, 𝑏) ∈ 𝜌.
grammar. By definition, 𝑃 can be divided into three
subsets:
Lemma 2. For every static WK linear grammar, there
𝑒 π‘₯
𝑒 π‘₯
(i) 𝑃1 = {𝑆 → [ ] (𝑦) 𝐴 ∈ 𝑃 | [ ] (𝑦) ∈ π‘…πœŒ (𝑇), 𝐴 ∈
𝑣
𝑣
exists a static WK linear grammar in 1-normal form.
Proof.
𝑁 − {𝑆}};
Let 𝐺 = (𝑁, 𝑇, 𝜌, 𝑆, 𝑃) be a static WK linear
grammar. By definition, 𝑃 can be divided into three
π‘₯
π‘₯
(ii) 𝑃2 = {𝐴 → (𝑦) 𝐡 ∈ 𝑃 | (𝑦) ∈ πΏπ‘…πœŒ ∗ (𝑇), 𝐴, 𝐡 ∈ 𝑁 −
subsets:
π‘₯2 𝑒2
𝑒1 π‘₯1
𝑒1 π‘₯1
(i) 𝑃1 = {𝑆 → [𝑣 ] (𝑦 ) 𝐴 (𝑦 ) [𝑣 ] ∈ 𝑃 | [𝑣 ] (𝑦 ) ∈
1
2
1
1
2
1
{𝑆}}; or
π‘₯2 𝑒2
π‘…πœŒ (𝑇), (𝑦 ) [𝑣 ] ∈ 𝐿𝜌 (𝑇) and 𝐴 ∈ 𝑁 − {𝑆}};
2
4
2
ASM Science Journal, Volume 13, 2020
grammar, where 𝑃 consists of the following rules:
π‘₯1
π‘₯2
π‘₯1
π‘₯2
(ii) 𝑃2 = {𝐴 → (𝑦 ) 𝐡 (𝑦 ) ∈ 𝑃 | (𝑦 ) , (𝑦 ) ∈
1
2
1
2
π‘Ž
𝑑
𝑆 → [ ]𝐴[ ]
π‘Ž
𝑑
∗
πΏπ‘…πœŒ (𝑇), 𝐴, 𝐡 ∈ 𝑁 − {𝑆}}; or
𝑃 = 𝑃1 ∪ 𝑃2 ∪ 𝑃3 .
𝐷→
Without
loss
of
𝑛+1
π‘Ž
π‘Ž π‘Žπ‘›
𝑑
𝑑𝑛 𝑑
𝑆 ⇒ [ ] 𝐴 [ ] ⇒∗ [ ] ( ) 𝐡 ( ) [ ] ⇒∗ [π‘Žπ‘›+1 ]
π‘Ž
π‘Ž πœ†
𝑑
πœ† 𝑑
π‘Ž
generality,
𝑛+1
𝑛+1 𝑏 π‘š
𝑛+1
π‘π‘š
𝐢 [𝑑𝑛+1 ] ⇒∗ [π‘Žπ‘›+1 ] ( ) 𝐢 ( ) [𝑑𝑛+1 ] ⇒∗
πœ†
πœ† 𝑑
𝑑
π‘Ž
1
1
2
2
𝑛+1 π‘š
π‘š 𝑛+1
𝑛+1 π‘š π‘š 𝑛+1
[π‘Žπ‘›+1 𝑏 π‘š ] 𝐸 [𝑐 π‘š 𝑑𝑛+1 ] ⇒ [π‘Žπ‘›+1 𝑏 π‘š 𝑐 π‘š 𝑑𝑛+1 ].
π‘Ž 𝑏
𝑐 𝑑
π‘Ž 𝑏 𝑐 𝑑
π‘Ž1 π‘Ž2 β‹― π‘Žπ‘› 𝑏 𝑏 β‹― π‘π‘˜
𝑐 𝑐 β‹― 𝑐𝑙 𝑑1 𝑑2 β‹― π‘‘π‘š
)𝐴( 1 2
)[
[π‘Ž π‘Ž β‹― π‘Ž ] ( 1 2
].
πœ†
𝑑1 𝑑2 β‹― π‘‘π‘š
1 2
𝑛
πœ†
Therefore, 𝐿(𝐺) = {π‘Žπ‘› 𝑏 π‘š 𝑐 π‘š 𝑑𝑛 | 𝑛, π‘š ≥ 1}.
We construct the sequence of new productions
Next, we need to show that 𝐿(𝐺) = {π‘Žπ‘› 𝑏 π‘š 𝑐 π‘š 𝑑𝑛 | 𝑛, π‘š ≥
π‘Ž
π‘Ž
{π‘Ÿ}: 𝑆 → [π‘Ž1 ] π΅π‘Ÿ 1 , π΅π‘Ÿ 1 → [π‘Ž2 ] π΅π‘Ÿ 2 , β‹―,
1
2
1} ∉ SREG. By contradiction, suppose that 𝐿(𝐺) can be
π‘Žπ‘›
𝑏
→ [π‘Ž ] π΅π‘Ÿ 𝑛 , π΅π‘Ÿ 𝑛 → ( 1 ) πΆπ‘Ÿ 1 , πΆπ‘Ÿ 1 →
𝑛
πœ†
generated
by
a
static
WK
regular
grammar 𝐺′ =
(𝑁′, {π‘Ž, 𝑏, 𝑐, 𝑑}, 𝜌, 𝑆, 𝑃′). Without loss of generality, assume
𝑑
𝑏
𝑏
( 2 ) πΆπ‘Ÿ 2 , β‹― , πΆπ‘Ÿ π‘˜−1 → ( π‘˜ ) πΆπ‘Ÿ π‘˜ , πΆπ‘Ÿ π‘˜ → π·π‘Ÿ 1 [ π‘š ] , β‹― ,
π‘‘π‘š
πœ†
πœ†
π·π‘Ÿ π‘š−1 → π·π‘Ÿ π‘š [
𝐡→
From this, we obtain the derivation:
π‘₯2 𝑒2
𝑒1 π‘₯1
consider π‘Ÿ: 𝑆 → [𝑣 ] (𝑦 ) 𝐴 (𝑦 ) [𝑣 ] =
π΅π‘Ÿ
,
πœ†
πœ†
πœ†
πœ†
πœ†
( )𝐷( ) | ( )𝐸( ) , 𝐸 → ( ) .
𝑏
𝑐
𝑏
𝑐
πœ†
{𝑆}};
𝑛−1
π‘Ž
π‘Ž
𝑑
𝑑
𝐴 → ( )𝐴( )|( )𝐡( )
πœ†
πœ†
πœ†
πœ†
𝑐
𝑐
πœ†
πœ†
πœ†
πœ†
𝑏
𝑏
( )𝐡( )|( )𝐢 ( ) , 𝐢 → ( )𝐢 ( )|( )𝐷( ) ,
πœ†
πœ†
π‘Ž
𝑑
π‘Ž
𝑑
πœ†
πœ†
π‘₯1
π‘₯1
(iii) 𝑃3 = {𝐴 → (𝑦 ) ∈ 𝑃 | (𝑦 ) ∈ πΏπ‘…πœŒ ∗ (𝑇), 𝐴 ∈ 𝑁 −
1
1
i.e.,
,
that 𝐺′ is in 1-normal form. Let 𝑀 = π‘Ž 𝑗 𝑏 π‘˜ 𝑐 π‘˜ 𝑑 𝑗 be a string
𝑐
𝑐
𝑑1
] , π·π‘Ÿ π‘š → π΄π‘Ÿ 1 ( 𝑙 ) , β‹― , π΄π‘Ÿ 𝑙−1 → 𝐴 ( 1 )
πœ†
πœ†
𝑑1
in 𝐿(𝐺). Then, the grammar 𝐺′ can generate the complete
where π΄π‘Ÿ 𝑖 , π΅π‘Ÿ 𝑗 , πΆπ‘Ÿ 𝑝 and π·π‘Ÿ π‘ž are new nonterminals that are
𝑗 π‘˜ π‘˜ 𝑗
sequences [π‘Ž 𝑗 𝑏 π‘˜ 𝑐 π‘˜ 𝑑 𝑗 ] as follows:
π‘Ž 𝑏 𝑐 𝑑
used in this production with 1 ≤ 𝑖 ≤ 𝑙 − 1, 1 ≤ 𝑗 ≤ π‘˜, 1 ≤
Case 1: In the derivation of the string, the first 𝑏 can
𝑝 ≤ π‘˜ and 1 ≤ π‘ž ≤ π‘š.
occur in the upper (or lower) strand if π‘Ž 𝑗 has already been
Next, we define a static WK linear grammar 𝐺′ =
generated in the upper (or lower) strand. Then, some of
(𝑁′, 𝑇, 𝜌, 𝑆, 𝑃′) where 𝑁′ contains the nonterminals of 𝑁
the possible derivations are:
and all new nonterminals introduced above and 𝑃′
contains the productions constituted above. Hence, every
production π‘Ÿ in 𝑃 can be replaced with the corresponding
sequence {π‘Ÿ} of productions in 𝑃′ and vice versa.
′
Therefore, 𝐿(𝐺) = 𝐿(𝐺 ).
𝑖
π‘Žπ‘ 𝑏
𝑆 ⇒∗ [π‘Žπ‘– ] ( ) ( ); or
πœ† πœ†
π‘Ž
(5)
𝑗
𝑗
𝑏
𝑆 ⇒∗ [π‘Ž 𝑗 ] ( ) or 𝑆 ⇒∗ [π‘Ž 𝑗 𝑏] ,
πœ†
π‘Ž
π‘Ž 𝑏
(6)
where 𝑖 + 𝑝 = 𝑗. For derivation (5), we can
β– 
continue the
derivation by using the lower strand of π‘Ž and upper
strand of 𝑏 to generate the upper strand of 𝑏 π‘˜ :
To further investigate on the computational properties
𝑖
𝑗
π‘˜
π‘Žπ‘ 𝑏
𝑆 ⇒∗ [π‘Žπ‘– ] ( ) ( ) ⇒∗ [π‘Ž 𝑗 ] (𝑏 ).
πœ† πœ†
πœ†
π‘Ž
π‘Ž
between the static WK regular and linear grammars, the
(7)
next lemma shows that there exist a static WK linear
Following that, derivation (7) is continued by generating
language which cannot be generated by static WK regular
the first 𝑐 in the upper strand and use the lower strand of
grammar.
𝑏 to control their number of occurrences:
We
consider
a
language
𝐿(𝐺) =
𝑗
π‘˜
𝑗 π‘˜
π‘˜
𝑆 ⇒∗ [π‘Ž 𝑗 ] (𝑏 ) ⇒∗ [π‘Ž 𝑗 𝑏 π‘˜ ] (𝑐 ).
πœ†
πœ†
π‘Ž
π‘Ž 𝑏
{π‘Ž 𝑛 𝑏 π‘š 𝑐 π‘š 𝑑𝑛 | 𝑛, π‘š ≥ 1} where the idea of 1-normal form is
used to simplify the length of the rules in the grammars as
Next, derivation (8) is continued by generating the first 𝑑
shown in Lemma 3.
Lemma 3.
in the upper strand and use the strand of 𝑐 to control their
number of occurrences:
𝐿(𝐺) = {π‘Žπ‘› 𝑏 π‘š 𝑐 π‘š 𝑑𝑛 | 𝑛, π‘š ≥ 1} ∈ SLIN−
𝑗 π‘˜
π‘˜
𝑗 π‘˜ π‘˜
𝑙
𝑆 ⇒∗ [π‘Ž 𝑗 𝑏 π‘˜ ] (𝑐 ) ⇒∗ [π‘Ž 𝑗 𝑏 π‘˜ 𝑐 π‘˜ ] (𝑑 ).
πœ†
πœ†
π‘Ž 𝑏
π‘Ž 𝑏 𝑐
SREG.
Proof.
(8)
Let
(9)
The derivation can be completed by generating the lower
𝐺 = ({𝑆, 𝐴, 𝐡, 𝐢, 𝐷, 𝐸}, {π‘Ž, 𝑏, 𝑐, 𝑑},
{(π‘Ž, π‘Ž), (𝑏, 𝑏), (𝑐, 𝑐), (𝑑, 𝑑)}, 𝑆, 𝑃) be a static WK linear
5
ASM Science Journal, Volume 13, 2020
𝑗 π‘˜ π‘˜ 𝑗
strand of 𝑑. Thus, [π‘Ž 𝑗 𝑏 π‘˜ 𝑐 π‘˜ 𝑑 𝑗 ] where 𝑙 ≠ 𝑗. For derivation
π‘Ž 𝑏 𝑐 𝑑
(6), we can continue the derivation by using the same idea
as in (5) and get the possible derivation as in (7) until (9).
Case 2: The number of π‘Ž in the lower strand is controlled
by the upper strand of 𝑐. In this case, the number of 𝑐′s
cannot
be
related
to
the
number
of
𝑏
such
𝑗
π‘˜ 𝑗
that 𝑆 ⇒∗ [π‘Ž 𝑗 ] (𝑏 𝑐 ). Thus, we can conclude that 𝐿(𝐺) ∉
π‘Ž
πœ†
SREG since the number of 𝑏, 𝑐 and 𝑑 are difficult to
control at the same time using SREG rules.
IV.
CONCLUSION
In this paper, the 1-normal form for static WK regular
and linear grammars are defined. We show that for each
grammar, there exists the equivalent grammars by using
the concept of 1-normal form. In addition, the
implementation of 1-normal form has been shown in
Lemma 3 to investigate the computational properties
between the static WK regular and linear grammars. It
has been found that there exist a static WK linear
language which cannot be generated by static WK
regular grammar. There are some interesting topic that
can be explored in the future research such as to study
the computational
properties
for
the
static
WK
grammars, define static WK context-free grammar and
introduce the normal forms for static WK context-free
grammar.
V.
ACKNOWLEDGEMENT
The first author would like to thank UTM Zamalah for
funding her studies at Universiti Teknologi Malaysia
(UTM) under the Zamalah Scholarship. The second and
third authors would also like to thank the Ministry of
Education (MOE) and Research Management Centre
(RMC), UTM
for
the financial funding
through
Fundamental Research Grant Scheme (FRGS) Vote No.
5F022.
6
ASM Science Journal, Volume 13, 2020
VI. REFERENCES
Linz, P. (2006). An introduction to formal languages and
Adleman, L. M. (1994). Molecular computation of solutions
automata, Jones and Bartlett Publishers, USA.
to combinatorial problems. Science, 266 (5187), 10211024.
P ãun, G. & Rozenberg, G. (1998). Sticker systems.
Theoretical Computer Science, 204, 183-203.
Freund, R., P ãun, G., Rozenberg, G. & Salomaa, A. (1997).
Watson-Crick finite automata. Proceeding 3rd DIMACS
P ãun, G., Rozenberg, G. & Salomaa, A. (1998). DNA
Workshop on DNA based Computers, Philadelphia, 297-
computing: new computing paradigms. Springer-Verlag
328.
& Business Media.
Hg, W. D. & Wong, C. K. B. (2007). Self-Recognition of DNA
Rahman, A. A., Fong, W. H., Sarmin, N. H., Turaev, S. &
from life processes to DNA computation. Biophysical
Zulkufli, N. L. M. (2018a). Static Watson-Crick regular
reviews and letters, 2(02), 123-137.
grammar,
and
grammars and its computational power, International
Kari, L., P ãun, G., Rozenberg, G., Salomaa, A. & Yu, S.
sticker
Applied
Zulkufli, N. L. M. (2018b). Static Watson-Crick linear
2008, Kyoto, Japan, World Scientific.
computing,
of
Rahman, A. A., Fong, W. H., Sarmin, N. H., Turaev, S. &
languages and algebraic systems: Proceedings of AFLAS
DNA
Journal
Fundamental Sciences, 14, 457-462.
Ito, M., Kobayashi, Y. & Shoji, K. (2010). Automata, Formal
(1998).
Malaysian
systems,
Graduate Conference on Engineering, Science and
and
Humanities (IGCESH) (accepted).
universality. Acta Informatica, 35(5), 401-420.
Levelt, W. J. (1974). Formal grammars in linguistics and
Subramanian, K. G., Hemalatha, S. & Venkat, I. (2012). On
psycholinguistics. Vol. III, Psycholinguistic Applications.
Watson-Crick automata. In Proceedings of the Second
Mouton.
International Conference on Computational Science,
Engineering and Information Technology, 151-156.
Zulkufli, N. L. M., Turaev, S., Mohd Tamrin, M. & Messikh,
A. (2016). Generative power and closure properties of
7
Watson-Crick
grammars.
Intelligence
and
Soft
Applied
Computational
Computing,
1-12
Related documents
Download