doc/openscop.texi

   1 \input texinfo
   2 @c %
   3 @c %  /**-----------------------------------------------------------------**
   4 @c %   **                           OpenScop Library                      **
   5 @c %   **-----------------------------------------------------------------**
   6 @c %   **                            openscop.texi                        **
   7 @c %   **-----------------------------------------------------------------**
   8 @c %   **                 First version: september 10th 2006              **
   9 @c %   **-----------------------------------------------------------------**/
  10 @c %
  11 @c % release 0.0: May 4th 2008
  12 @c %
  13
  14 @c % /*************************************************************************
  15 @c %  *                              PART I: HEADER                           *
  16 @c %  *************************************************************************/
  17 @c %**start of header
  18 @setfilename openscop.info
  19 @settitle OpenScop Specification and Library
  20
  21 @set EDITION 1.0
  22 @set SPEC_VERSION 1.1
  23 @set LIB_VERSION 0.9.0
  24 @set UPDATED June 3rd 2014
  25 @setchapternewpage odd
  26
  27 @c % This is to ask for A4 instead of Letter size document.
  28 @iftex
  29      @afourpaper
  30 @end iftex
  31
  32 @c %**end of header
  33
  34 @c % /************************************************************************
  35 @c %  *                PART II: SUMMARY DESCRIPTION AND COPYRIGHT            *
  36 @c %  ************************************************************************/
  37
  38 @copying
  39 This document describes OpenScop, a specification of a file format and a set
  40 of data structures for polyhedral compilation tools to talk
  41 together. It also describes briefly the OpenScop Library version @value{LIB_VERSION},
  42 a Free Software that provides an example of OpenScop implementation.
  43
  44 It would be quite kind to refer at the present document in any publication that
  45 results from the use of the OpenScop Library:
  46
  47 @example
  48 @@TechReport@{Bas11,
  49 @ @ author =@ @ @ @ @ @ @{C\'edric Bastoul@},
  50 @ @ title =@ @ @ @ @ @ @ @{OpenScop: A Specification and a Library for Data
  51 @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ Exchange in Polyhedral Compilation Tools@},
  52 @ @ month =@ @ @ @ @ @ @ @{September@},
  53 @ @ year =@ @ @ @ @ @ @ @ 2011,
  54 @ @ institution = @{Paris-Sud University, France@}
  55 @}
  56 @end example
  57
  58 Copyright @copyright{} 2011 Paris-Sud University and INRIA.
  59
  60 @c quotation
  61 Permission is granted to copy, distribute and/or modify this document under
  62 the terms of the GNU Free Documentation License, Version 1.2 published by the
  63 Free Software Foundation; with no Invariant Sections, with no Front-Cover
  64 Texts, and with no Back-Cover Texts. To receive a copy of the
  65 GNU Free Documentation License, write to the Free Software Foundation, Inc.,
  66 59 Temple Place, Suite 330, Boston, MA  02111-1307 USA.
  67 @c end quotation
  68 @end copying
  69
  70 @c % /*************************************************************************
  71 @c %  *                 PART III: TITLEPAGE, CONTENTS, COPYRIGHT              *
  72 @c %  *************************************************************************/
  73 @titlepage
  74 @title OpenScop
  75 @subtitle A Specification and a Library for Data Exchange in Polyhedral Compilation Tools
  76 @subtitle Edition @value{EDITION}, for OpenScop Specification @value{SPEC_VERSION} and OpenScop Library @value{LIB_VERSION}
  77 @subtitle @value{UPDATED}
  78 @author C@'edric Bastoul
  79
  80 @c The following two commands start the copyright page.
  81 @page
  82 @vskip 0pt plus 1filll
  83 @insertcopying
  84 @end titlepage
  85
  86 @c Output the table of contents at the beginning.
  87 @contents
  88
  89 @c % /*************************************************************************
  90 @c %  *                     PART IV: TOP NODE AND MASTER MENU                 *
  91 @c %  *************************************************************************/
  92 @ifnottex
  93 @node Top
  94 @top OpenSCop
  95
  96 @insertcopying
  97 @end ifnottex
  98
  99 @menu
 100 * Introduction::
 101 * Polyhedral Representation::
 102 * OpenScop Specification::
 103 * OpenScop Library::
 104 * References::
 105 @end menu
 106
 107 @c % /*************************************************************************
 108 @c %  *                       PART V: BODY OF THE DOCUMENT                    *
 109 @c %  *************************************************************************/
 110
 111 @c %  ****************************** INTRODUCTION ******************************
 112 @node Introduction
 113 @chapter Introduction
 114 OpenScop is an open specification that defines a file format and a set of
 115 data structures to represent a @emph{static control part} (SCoP for short),
 116 i.e., a program part that can be represented in the @emph{polyhedral model}.
 117 The goal of OpenScop is to provide a common interface to various
 118 polyhedral compilation tools in order to simplify their interaction.
 119
 120 Designing a single format for tools that have different purposes
 121 (e.g., as different as code generation and data dependence analysis) may
 122 sound strange at first. However we could observe that most available
 123 polyhedral compilation tools during the last decade were manipulating
 124 more or less the same kind of data (polyhedra, affine functions...) and
 125 were actually sharing a part of their input (e.g., iteration domains and
 126 context concepts are nearly everywhere). We could also observe that
 127 those tools may rely on different internal representations, mostly based on
 128 one of the major polyhedral libraries (e.g., Polylib, PPL or isl), and
 129 this representation may change over time (e.g., when switching to a
 130 more convenient polyhedral library).
 131 The OpenScop aim is to provide a stable, unified format that offers a
 132 durable guarantee that a tool can use an output or provide an input to
 133 another tool without breaking a tool chain because of some internal
 134 changes in one element of the chain. The other promise of OpenScop is
 135 the ability to assemble or replace the basic blocks of a polyhedral
 136 compilation framework at no, or at least low engineering cost.
 137
 138 The policy that drives OpenScop can be summarized by these three rules:
 139 @itemize @bullet
 140 @item  Embed the @emph{minimum} information to build a complete polyhedral
 141        compilation framework in the so-called @emph{core part}
 142        (to avoid as much as possible empty or useless information
 143        for each tool).
 144 @item  Provide a @emph{very stable} core part (so users have some
 145        guarantee that they will not need to update their tool
 146        because of frequent specification evolution),
 147 @item  Provide a @emph{very flexible} extension part (so it can also
 148        be used to test wild new ideas).
 149 @end itemize
 150
 151 @noindent Another, more technical, rule may be added:
 152 @itemize @bullet
 153 @item  Avoid any need for external library or tool to support it
 154        (i.e., it's not XML or YAML or anything like that).
 155 @end itemize
 156
 157 The success of OpenScop in meeting its goals totally depends on its
 158 acceptance by the tool developers (that have to support it in their tools).
 159 To help them, we provide an example implementation: the OpenScop Library.
 160 This library (and in particular its API) is not part of the OpenScop
 161 specification (which includes only the file format and the set of data
 162 structures). It is licensed under the 3-clause BSD license so developers may
 163 feel free to use it in their code (either by linking it or copy-pasting its
 164 code). This document also describes this library briefly (readers are
 165 invited to read at its technical documentation).
 166 The current version of the OpenScop Library is still under evaluation,
 167 and there is no guarantee that the upward compatibility will be respected,
 168 even if we do think so. A lot of reports are
 169 necessary to freeze the library API. Thus you are very welcome and
 170 encouraged to send reports on bugs, wishes, critics, comments, suggestions
 171 or (please!) successful experiences to the OpenScop mailing list
 172 @email{openscop-development@@googlegroups.com}.
 173
 174 This document is organized as follows. First, we provide some
 175 background on the polyhedral model and how it is used to represent and to
 176 manipulate programs (@pxref{Polyhedral Representation}). Next,
 177 we describe the OpenScop specification, from the file format
 178 (@pxref{OpenScop File Format Specification}) to the data structures
 179 and the OpenScop Library API
 180 (@pxref{OpenScop Data Structure Specification}).
 181 Finally we will detail how to install the OpenScop Library
 182 (@pxref{Installation}).
 183
 184
 185 @c %  ******************* POLYHEDRAL REPRESENTATION OF PROGRAMS ****************
 186 @node Polyhedral Representation
 187 @chapter Polyhedral Representation of Programs
 188 If you are reading at the OpenScop documentation, you probably don't need any
 189 explanation about the polyhedral model. It is unlikely that someone will read
 190 this paper by mistake. However some vicious advisor may ask their poor
 191 engineers/interns/students
 192 to work for the very first time on this exciting topic. Most papers on
 193 polyhedral compilation are hard to read. Despite my efforts,
 194 mine are no exception according to some reviewers. Hence I give there a new
 195 try to provide a comprehensive explanation of the polyhedral model without the
 196 size and style limits of a classical research paper.
 197
 198 Be aware that to be able to understand the polyhedral model, there are a few
 199 prerequisites. You should not read the following while you still ignore
 200 what is:
 201 @itemize @bullet
 202 @item  a @code{for} loop construction in C programs (@code{do} loops in FORTRAN are OK too!),
 203 @item  an @emph{affine expression},
 204 @item  a @emph{vector},
 205 @item  a @emph{matrix},
 206 @item  a @emph{matrix-vector multiply}.
 207 @end itemize
 208 If you do not know those concepts, please do some search and come back
 209 afterwards. And if you are courageous enough, send me a few lines that
 210 describe them so I can insert them here!
 211
 212 @menu
 213 * Motivation::
 214 * Thinking in Polyhedra::
 215 * What's Next?::
 216 @end menu
 217
 218
 219 @node Motivation
 220 @section Motivation: Program Transformations
 221
 222 A direct translation of high level programs written, e.g., in C, to assembly
 223 then to object code is likely to produce (very) inefficient applications.
 224 Architectures are
 225 quite complex, including several levels of cache memory, many cores, deep
 226 pipelines, various number of functional units, of registers etc.
 227 The list of such
 228 "architectural features" is growing with each new generation of processors.
 229 To achieve the best performance, the object program must use
 230 these features in a smart way.
 231 Programmers use high level languages for productivity and portability:
 232 typically they do not have to take care of the target architecture but
 233 to ensure they write programs which produce the right output. Hence,
 234 the problem of mapping the program to the target architecture in the most
 235 efficient way is left to the compiler.
 236
 237 The compiler may see a high level program as a specification
 238 @emph{of an output}. The program is a list of instructions to be executed to
 239 produce the output. As long as the output is guaranteed to be as the
 240 programmer specified in his code, the compiler is free to modify
 241 the program.
 242 For instance, let us imagine we are working on an architecture with only
 243 three registers and we consider the following statements written by
 244 a programmer:
 245
 246 @example
 247 @group
 248 x = a + b;
 249 y = c + d;
 250 z = a * b;
 251 @end group
 252 @end example
 253
 254 It is easy to see that we can reorder the three statements in any way without
 255 modifying the semantics (no statement reads or writes a variable that another
 256 statement writes). Because of the lack of registers, the solutions such that
 257 the first and the third statements are one after the other are better
 258 because @code{a} and @code{b} will be put in the processor registers by
 259 one statement and can be reused directly by the other one
 260 without reading from memory (this is called a @emph{data locality
 261 improving} transformation). Hence a better statement order is, e.g.:
 262
 263 @example
 264 @group
 265 x = a + b;
 266 z = a * b; // a and b are still in processor registers
 267 y = c + d;
 268 @end group
 269 @end example
 270
 271 We can also notice that it is possible to run the three statements in
 272 parallel (possibly on different processors). The programmer may
 273 explicit this in a way the compiler
 274 and/or the architecture is able to understand. For instance,
 275 we can use OpenMP to describe parallelism
 276 (this is called a @emph{parallelizing} transformation):
 277
 278 @example
 279 @group
 280 #pragma omp parallel sections
 281 @{
 282    #pragma omp section
 283    @{
 284      x = a + b;
 285    @}
 286    #pragma omp section
 287    @{
 288      y = c + d;
 289    @}
 290    #pragma omp section
 291    @{
 292      z = a * b;
 293    @}
 294 @}
 295 @end group
 296 @end example
 297
 298 However, the right way to optimize this program is probably a tradeoff
 299 between these two techniques. This is true if, e.g., the target
 300 architecture has some limitations to run too many operations in parallel,
 301 or, like in our case, when some data may be reused by some processors.
 302 Hence, the best optimization for our small example is probably the
 303 following:
 304
 305 @example
 306 @group
 307 #pragma omp parallel sections
 308 @{
 309    #pragma omp section
 310    @{
 311      x = a + b;
 312      z = a * b;
 313    @}
 314    #pragma omp section
 315    @{
 316      y = c + d;
 317    @}
 318 @}
 319 @end group
 320 @end example
 321
 322 This example is quite trivial because the statements are
 323 executed only once. The real sport begins when we have to deal with loops,
 324 as we will see momentarily. However, polyhedral compilation framework users
 325 have to be conscious that we @emph{need} to transform programs to achieve
 326 the best performance and that the best transformation  that has to be
 327 discovered (at the price of many, many efforts) and performed may be
 328 quite complex. Hence the need of powerful model and tools.
 329
 330
 331 @node Thinking in Polyhedra
 332 @section Thinking in Polyhedra
 333
 334
 335 Since the very first compilers, the internal representation of programs
 336 is the @emph{Abstract Syntax Tree}, or AST. In such representation,
 337 each statement appears only once even if it is executed many times (e.g.,
 338 when it is enclosed inside a loop). This is a limitation
 339 for finding and applying complex transformations:
 340 @itemize @bullet
 341 @item It limits program analysis power. For instance if a statement
 342       @emph{depends} on another statement (i.e., they access the same
 343       memory location and at least one of these accesses is a write),
 344       we will consider both statements as unique entities while the
 345       dependence relation may involve only few statement executions.
 346 @item It limits program transformation power. Loop transformations
 347       operate on statement executions. For instance, because they
 348       consider all statement executions at the same time, present day
 349       production compilers are not able to achieve loop fusion
 350       (that tries to merge the loop bodies of two loops) if the loop bounds
 351       of the two loops do not match (yes, that's ridiculous).
 352 @item It limits program manipulation flexibility.
 353       Trees are very rigid data structures that are not easy to manipulate.
 354       Program transformation may require very complex transformations that will
 355       imply deep modifications of the control flow.
 356 @end itemize
 357
 358 The Polyhedral Model is a convenient alternative representation which
 359 combines analysis power, expressiveness and high flexibility. The drawback
 360 is it breaks the classical structure of programs that every programmer
 361 is familiar with. It requires some (real) efforts
 362 to be smoothly manipulated, but it definitely worth it. It is based on three
 363 main concepts, @emph{iteration domain},  @emph{scattering function} and
 364 @emph{access function} that are described in depth in the
 365 following sections.
 366
 367 A program part that can be represented using the Polyhedral Model is called
 368 a @strong{Static Control Part} or @strong{SCoP} for short.
 369
 370 @menu
 371 * Iteration Domain::
 372 * Scattering Function::
 373 * Access Function::
 374 @end menu
 375
 376 @node Iteration Domain
 377 @subsection Iteration Domain
 378
 379 The key aspect of the polyhedral model is to consider @emph{statement
 380 instances}. A statement instance is @emph{one} execution of a statement.
 381 A statement
 382 outside a loop has only one instance while those inside loops may have many.
 383 Let us consider the following code with two statements @code{S1}
 384 and @code{S2}:
 385
 386 @example
 387 @group
 388 pi = 3.14;               // S1
 389 for (i = 0; i < 5; i++)
 390   A[i] = pi;             // S2
 391 @end group
 392 @end example
 393
 394 The list of statement instances is the following (we just have to fully
 395 unroll the loop):
 396
 397 @example
 398 @group
 399 pi = 3.14;
 400 A[0] = pi;
 401 A[1] = pi;
 402 A[2] = pi;
 403 A[3] = pi;
 404 A[4] = pi;
 405 @end group
 406 @end example
 407
 408 Each instance of a statement which is enclosed inside a loop may be referred
 409 thanks to its outer loop counters (or @emph{iterators}). In the polyhedral
 410 model we consider statements as functions of the outer loop counters that may
 411 produce statement instances:
 412 instead of simply "@code{S2}", we use preferably the notation @code{S2(i)}.
 413 For instance we  denote the statement instance @code{A[3] = pi;} of the
 414 previous example as @code{S2(3)}. This means @emph{instance of
 415 statement @code{S2} for} @code{i = 3}.
 416 If a statement @code{S3} is enclosed inside two loops of iterators @code{i}
 417 (outermost loop) and @code{j} (innermost loop), we would denote it
 418 @code{S3(i,j)}, and so on with more enclosing loops.
 419
 420 The ordered list
 421 of iterators (ordered from the outermost iterator to the innermost iterator)
 422 is called the @strong{iteration vector}. For instance the iteration vector for
 423 @code{S3} is @code{(i,j)}, for @code{S2} it is @code{(i)}, and for @code{S1}
 424 it is empty since it has no enclosing loop: @code{()}. A more precise reading
 425 at the notation @code{S2(3)} would show that it denotes the instance of
 426 statement @code{S2} for the iteration vector @code{(2)}.
 427
 428 Obviously, dealing with statement instances does not mean we have to unroll all
 429 loops. First because there would be probably too many instances to deal with,
 430 and second because we probably just do not know how many instances there are.
 431 For instance in the following loop it is impossible to know (at compile time)
 432 how many times the statement @code{S3} will be executed:
 433
 434 @example
 435 @group
 436 for (i = 2; i <= N; i++)
 437   for (j = 2; j <= N; j++)
 438     A[i] = pi;             // S3
 439 @end group
 440 @end example
 441
 442 @noindent Such a loop is said to be @emph{parametric}: it depends on
 443 (at least) a value called a @emph{parameter} which is not modified
 444 during the execution of the whole loop, but is unknown at compile time.
 445 Here, the only parameter is @code{N}.
 446
 447 A compact way to represent all the instances of a given statement
 448 is to consider the set of all possible values of its iteration vector.
 449 This set is called the @strong{iteration domain}. It can be conveniently
 450 described thanks to all the constraints on the various iterators the statement
 451 depends on. For instance, let us consider
 452 the statement @code{S3} of the previous program. The iteration domain is the set
 453 of iteration vectors @code{(i,j)}. Because of the parameter, we are not able to
 454 achieve a precise list of all possible values. It would look like this:
 455
 456 @example
 457 @group
 458 (2,2)  (2,3)  (2,4)  ...  (2,N)
 459 (3,2)  (3,3)  (3,4)  ...  (3,N)
 460 ...    ...    ...    ...  ...
 461 (N,2)  (N,3)  (N,4)  ...  (N,N)
 462 @end group
 463 @end example
 464
 465 @noindent A better way is to say it is the set
 466 of iteration vectors @code{(i,j)} such that @code{i} is an integer greater or
 467 equal than 2 and lower or equal than @code{N}, and @code{j} is an integer
 468 greater or equal than 2 and lower or equal than @code{N}. This may be written
 469 in the following mathematical form:
 470
 471 @tex
 472 $$D _{S3} =  \{(i,j) \in Z^2 \; | \; 2 \leq i \leq N \land 2 \leq j \leq N \}$$
 473 @end tex
 474
 475 @ifnottex
 476 @example
 477 @group
 478 D_S3 =  @{(i,j) in Z^2 | 2 <= i <= N && 2 <= j <= N @}
 479 @end group
 480 @end example
 481 @end ifnottex
 482
 483 @noindent It is easy to see that this iteration domain is a part of the
 484 2-dimensional space
 485 @tex
 486 $Z^2$.
 487 @end tex
 488 @ifnottex
 489 @example
 490 @group
 491 Z^2.
 492 @end group
 493 @end example
 494 @end ifnottex
 495 We often use in our research papers a graphical representation that gives a
 496 better view of this subspace:
 497
 498 @image{images/basic1,4cm}
 499
 500 @noindent Here, the iteration domain is specified thanks to a set of
 501 constraints. When those constraints are affine and
 502 depend only on the outer loop counters and some parameters, the set of
 503 constraints defines a @emph{polyhedron} (more precisely this is a
 504 @emph{Z-polyhedron}, but we use @emph{polyhedron} for short).
 505 Hence the @emph{polyhedral model}!
 506
 507 To manipulate a set of affine constraints easily, we rely on a matrix
 508 representation. To write it, we use the @emph{homogeneous} iteration vector:
 509 it is simply the iteration vector with some additional dimensions to
 510 represent the parameters and the constant.
 511 For instance for the statement @code{S3}, the
 512 iteration vector in homogeneous coordinates is @code{(i, j, N, 1)}
 513 (we will now call it @emph{iteration vector} directly for short).
 514 Then we write all the constraints as affine inequalities of the form
 515 @emph{affine constraint}
 516 @tex
 517 $\geq 0$.
 518 @end tex
 519 @ifnottex
 520 @code{ >= 0}.
 521 @end ifnottex
 522 For instance for the statement @code{S3} the set of constraints is:
 523
 524 @tex
 525 $$
 526 \hbox{$ \cases{  i         - 2  &$\geq 0$\cr
 527                 -i     + N      &$\geq 0$\cr
 528                      j     - 2  &$\geq 0$\cr
 529                     -j + N      &$\geq 0$}$}
 530 $$
 531 @end tex
 532
 533 @ifnottex
 534 @example
 535 @group
 536     i - 2 >= 0
 537    -i + N >= 0
 538     j - 2 >= 0
 539    -j + N >= 0
 540 @end group
 541 @end example
 542 @end ifnottex
 543
 544 @noindent Lastly, we translate the constraint system to the form
 545 @strong{domain matrix}@code{ * }@emph{iteration vector}@code{ >= 0}:
 546
 547 @c Thanks to Harald Devos for the TeX :-) !
 548 @tex
 549 $$
 550 \left[
 551 \matrix{
 552  1 &  0  & 0  & -2 \cr
 553 -1 &  0  & 1  &  0 \cr
 554  0 &  1  & 0  & -2 \cr
 555  0 & -1  & 1  &  0
 556  }
 557 \right]
 558 \left(
 559 \matrix{
 560 i \cr
 561 j \cr
 562 N \cr
 563 1
 564  }
 565 \right)
 566 \ge
 567 \left(
 568 \matrix{
 569 0 \cr
 570 0 \cr
 571 0 \cr
 572 0
 573 }
 574 \right)
 575 $$
 576 @end tex
 577 @ifnottex
 578 @example
 579 @group
 580 [  1  0  0 -2 ]   [ i ]    [ 0 ]
 581 [ -1  0  1  0 ]   [ j ]    [ 0 ]
 582 [  0  1  0 -2 ] * [ N ] >= [ 0 ]
 583 [  0 -1  1  0 ]   [ 1 ]    [ 0 ]
 584 @end group
 585 @end example
 586 @end ifnottex
 587
 588 @noindent @strong{The domain matrix will be used in all our tools to provide the
 589 information on the iteration domain of a given statement (the iteration vector
 590 is in general an implicit information).}
 591
 592 @node Scattering Function
 593 @subsection Scattering Function
 594
 595 There is no ordering information inside the iteration domain: it only describes
 596 the set of statement instances but @strong{not} the order in which they have
 597 to be executed relatively to each other. In the past the lexicographic order
 598 of the iteration domain was considered, this is no more true
 599 (especially when using CLooG). If we do not provide any ordering information, this
 600 means that the statement instances may be executed in any order
 601 (this is useful, e.g., to specify parallelism). However, some statement instances
 602 may depend on some others and it may be critical to enforce a given order (or
 603 non-order). Hence we need another information.
 604
 605 We call @emph{scattering} any kind of ordering information in the polyhedral
 606 model. There exists many kind of ordering, such as @emph{allocation},
 607 @emph{scheduling}, @emph{chunking} etc. They are all expressed
 608 in the same way, i.e., using @emph{logical stamps}, but they may have
 609 different semantics.
 610
 611 In the case of @strong{scheduling}, the logical stamps are logical dates that
 612 express at which date a statement instance has to be executed. For instance,
 613 let us consider the following three statements:
 614
 615 @example
 616 @group
 617 x = a + b;   // S1
 618 y = c + d;   // S2
 619 z = a * b;   // S3
 620 @end group
 621 @end example
 622
 623 @noindent The scheduling of a statement @code{S} is typically
 624 denoted by
 625 @tex
 626 $\theta_{S}$.
 627 @end tex
 628 @ifnottex
 629 T_S.
 630 @end ifnottex
 631 Let us consider the following logical dates for each statement:
 632
 633 @example
 634 @group
 635 @tex
 636 $\theta_{S1} = 2$
 637 $\theta_{S2} = 3$
 638 $\theta_{S3} = 1$
 639 @end tex
 640 @end group
 641 @end example
 642
 643 @ifnottex
 644 @example
 645 @group
 646 T_S1 = 1
 647 T_S2 = 2
 648 T_S3 = 3
 649 @end group
 650 @end example
 651 @end ifnottex
 652
 653 @noindent It means that statement @code{S3} has to be executed at logical date
 654 @code{1}, statement @code{S1} has to be executed at logical date
 655 @code{2} and statement @code{S2} has to be executed at logical date
 656 @code{3}. The target code has to respect this scheduling (the order of
 657 the logical dates), hence it would look like the following where the
 658 variable @code{t} denotes the time:
 659
 660 @example
 661 @group
 662 t = 1;
 663 z = a * b;   // S3
 664 t = 2;
 665 x = a + b;   // S1
 666 t = 3;
 667 y = c + d;   // S2
 668 @end group
 669 @end example
 670
 671 @noindent When some statements share the same logical date, this means that,
 672 once the program reaches this logical date, the two statements can be executed
 673 in any order, or better, in parallel. For instance let us consider the
 674 following scheduling:
 675
 676 @example
 677 @group
 678 @tex
 679 $\theta_{S1} = 1$
 680 $\theta_{S2} = 2$
 681 $\theta_{S3} = 1$
 682 @end tex
 683 @end group
 684 @end example
 685
 686 @ifnottex
 687 @example
 688 @group
 689 T_S1 = 1
 690 T_S2 = 2
 691 T_S3 = 1
 692 @end group
 693 @end example
 694 @end ifnottex
 695
 696 @noindent Statements @code{S1} and @code{S3} have the same logical date,
 697 moreover, @code{S2} has a greater logical date than @code{S1} and @code{S3}.
 698 Hence the target code would be:
 699
 700 @example
 701 @group
 702 t = 1;
 703 #pragma omp parallel sections
 704 @{
 705    #pragma omp section
 706    @{
 707      x = a + b;   // S1
 708    @}
 709    #pragma omp section
 710    @{
 711      z = a * b;   // S3
 712    @}
 713 @}
 714 t = 2;
 715 y = c + d;        // S2
 716 @end group
 717 @end example
 718
 719 Logical dates may be multidimensional, as clocks: the first dimension
 720 may correspond to days (most significant), the next one to hours (less
 721 significant), the third to minutes and so on. For instance we can consider
 722 the following multidimensional schedules for our example:
 723
 724 @example
 725 @group
 726 @tex
 727 $\theta_{S1} = (1,1)$
 728 $\theta_{S2} = (2,1)$
 729 $\theta_{S3} = (1,2)$
 730 @end tex
 731 @end group
 732 @end example
 733
 734 @ifnottex
 735 @example
 736 @group
 737 T_S1 = (1,1)
 738 T_S2 = (2,1)
 739 T_S3 = (1,2)
 740 @end group
 741 @end example
 742 @end ifnottex
 743
 744 @noindent It is not very hard to decypher the meaning of such scheduling.
 745 Because of the first dimension, statements @code{S1} and @code{S3} will be
 746 executed before statement @code{S2} (@code{S1} and @code{S3} are executed at
 747 day 1, while @code{S2} is executed at day 2). The second dimension is not
 748 really useful there for @code{S2} because it is the only statement executed
 749 at day 2. Nevertheless it allows to order @code{S1} and
 750 @code{S3} relatively to each other since @code{S1} is executed at hour 1 of day
 751 1 while @code{S3} is executed at hour 2 of day 1.
 752 The corresponding target code is the following, with some
 753 additional time variables for a better view of the ordering (@code{t1}
 754 corresponds to the first time dimension, @code{t2} to the second one):
 755
 756 @example
 757 @group
 758 t1 = 1;
 759 t2 = 1;
 760 x = a + b;   // S1
 761 t2 = 2;
 762 z = a * b;   // S3
 763 t1 = 2;
 764 t2 = 1;
 765 y = c + d;   // S2
 766 @end group
 767 @end example
 768
 769 In the case of @strong{allocation} (in the literature we can find some
 770 papers calling it @emph{placement}), the logical stamp is a processor
 771 number expressing on which processor a statement instance has to be
 772 executed. Typically, allocations are written in the same way as scheduling.
 773 Here, we denote it @math{P_S} for a statement @code{S}.
 774 For instance, let us consider the following allocation:
 775
 776 @example
 777 @group
 778 @tex
 779 $P_{S1} = 1$
 780 $P_{S2} = 2$
 781 $P_{S3} = 1$
 782 @end tex
 783 @end group
 784 @end example
 785
 786 @ifnottex
 787 @example
 788 @group
 789 P_S1 = 1
 790 P_S2 = 2
 791 P_S3 = 1
 792 @end group
 793 @end example
 794 @end ifnottex
 795
 796 @noindent The corresponding target code has to take into account that both
 797 statements @code{S1} and @code{S3} have to be executed on the same processor
 798 (they have the same logical number 1) and that statement @code{S2} has
 799 to be executed on another processor (logical number 2). A possible target code
 800 is the following:
 801
 802 @example
 803 @group
 804 #pragma omp parallel sections
 805 @{
 806    #pragma omp section
 807    @{
 808      // Logical processor 1
 809      x = a + b;   // S1
 810      z = a * b;   // S3
 811    @}
 812    #pragma omp section
 813    @{
 814      // Logical processor 2
 815      y = c + d;   // S2
 816    @}
 817 @}
 818 @end group
 819 @end example
 820
 821 @noindent We can note that no order has been specified for the
 822 statements @code{S1} and @code{S3} that are executed on the same processor.
 823 Hence any order is satisfying. For sake of flexibility, it is usual to build
 824 a scattering whose various dimensions do not have the same semantics. A
 825 typical construction is @emph{space/time mapping} where the first @code{n}
 826 dimensions are devoted to allocation, then the last @code{m}
 827 dimensions are devoted to scheduling. Typically, space/time mapping is
 828 written in the same way as scheduling.
 829 Here we denote it @math{M_S} for a statement @code{S}.
 830 For instance, let us consider the following space/time mapping for our
 831 example where one dimension is devoted to mapping and one dimension is
 832 devoted to scheduling:
 833
 834 @example
 835 @group
 836 @tex
 837 $M_{S1} = (1,2)$
 838 $M_{S2} = (2,1)$
 839 $M_{S3} = (1,1)$
 840 @end tex
 841 @end group
 842 @end example
 843
 844 @ifnottex
 845 @example
 846 @group
 847 M_S1 = (1,2)
 848 M_S2 = (2,1)
 849 M_S3 = (1,1)
 850 @end group
 851 @end example
 852 @end ifnottex
 853
 854 @noindent Here we have the same first dimension as the previous example, thus
 855 the allocation of the statements to processors is the same. The second
 856 dimension precises on a given processor at which logical date a statement
 857 instance has to be executed. Here, the statement @code{S1} is executed at
 858 day 2 on processor 1 while the statement @code{S3} is executed at day 1 onto
 859 the same processor. It follows this space/time mapping corresponds to the
 860 following target code (we added an additional variable to represent the
 861 local logical clocks):
 862
 863 @example
 864 @group
 865 #pragma omp parallel sections
 866 @{
 867    #pragma omp section
 868    @{
 869      // Logical processor 1
 870      t = 1;
 871      z = a * b;   // S3
 872      t = 2;
 873      x = a + b;   // S1
 874    @}
 875    #pragma omp section
 876    @{
 877      // Logical processor 2
 878      t = 1;
 879      y = c + d;   // S2
 880    @}
 881 @}
 882 @end group
 883 @end example
 884
 885 For the same reason as discussed for iteration domains
 886 (@pxref{Iteration Domain}), it is not possible to define a scattering for
 887 each statement instance, especially if the statement belongs to a
 888 (possibly parametric) loop. The iteration vector fully defines an
 889 instance of a given statement. Thus, a practical way to provide a scattering
 890 for each instance of a given statement is to use a @emph{function}
 891 that depends on the iteration vector. In this way the function may associate
 892 to each iteration vector a different scattering. We call these functions
 893 @strong{scattering functions}. Scattering functions are @emph{affine}
 894 functions of the outer loop counter and the global parameters.
 895 For instance, let us consider the following source code:
 896
 897 @example
 898 @group
 899 for (i = 2; i <= 4; i++)
 900   for (j = 2; j <= 4; j++)
 901     P[i+j] += A[i] + B[j]; // S4
 902 @end group
 903 @end example
 904
 905 @noindent The iteration domain of the statement @code{S4} is:
 906
 907
 908 @tex
 909 $$D _{S4} =  \{(i,j) \in Z^2 \; | \; 2 \leq i \leq 4 \land 2 \leq j \leq 4 \}.$$
 910 @end tex
 911 @ifnottex
 912 @example
 913 @group
 914 D_S4=  @{(i,j) in Z^2 | 2 <= i <= 4 && 2 <= j <= 4 @}.
 915 @end group
 916 @end example
 917 @end ifnottex
 918
 919 @noindent If you are still not comfortable with the mathematical notation, it
 920 corresponds to the following graphical representation:
 921
 922 @image{images/basic2,3cm}
 923
 924 @noindent The list of the statement instances of @code{S4} (the integer
 925 points of its iteration domain) corresponds to the following iteration vectors:
 926
 927 @example
 928 @group
 929 iteration vector
 930      (2,2)
 931      (2,3)
 932      (2,4)
 933      (3,2)
 934      (3,3)
 935      (3,4)
 936      (4,2)
 937      (4,3)
 938      (4,4)
 939 @end group
 940 @end example
 941
 942 @noindent Let us suppose we want to schedule the instances of the statement
 943 @code{S4} (the integer points of its iteration domain) using the following
 944 scheduling function:
 945
 946 @example
 947 @group
 948 @tex
 949 $\theta_{S4}(i, j) = (j+2, 3*i+j)$
 950 @end tex
 951 @end group
 952 @end example
 953
 954 @ifnottex
 955 @example
 956 @group
 957 T_S4(i,j) = (j+2,3*i+j)
 958 @end group
 959 @end example
 960 @end ifnottex
 961
 962 @noindent We only need to apply the function to each iteration vector to find
 963 the logical date of each instance:
 964
 965 @example
 966 @group
 967 iteration vector       logical date
 968      (2,2)       -->       (4,8)
 969      (2,3)       -->       (5,9)
 970      (2,4)       -->       (6,10)
 971      (3,2)       -->       (4,11)
 972      (3,3)       -->       (5,12)
 973      (3,4)       -->       (6,13)
 974      (4,2)       -->       (4,14)
 975      (4,3)       -->       (5,15)
 976      (4,4)       -->       (6,16)
 977 @end group
 978 @end example
 979
 980 The polyhedral model users do not have to take care about the generation of a
 981 target code that respects the scattering: the
 982 CLooG@footnote{@url{http://www.cloog.org}} tool is there to
 983 solve the problem quite easily. For the previous
 984 example, the target code would be the following (@code{t1} and @code{t2}
 985 correspond to the two dimensions of the logical date, the reader may
 986 take care that this code actually implements the scattering function):
 987
 988 @example
 989 @group
 990 for (t1 = 4; t1 <= 6; t1++) @{
 991   for (t2 = t1+4; t2 <= t1+10; t2++) @{
 992     if ((-t1+t2+2)%3 == 0) @{
 993       i = (-t1+t2+2)/3 ;
 994       j = t1-2 ;
 995       P[i+j] += A[i] + B[j]; // S4
 996     @}
 997   @}
 998 @}
 999 @end group
1000 @end example
1001
1002
1003
1004 Obviously with such a twisted scheduling, it is hard to see the "meaning"
1005 of the transformation. To name any kind of program transformation as
1006 a magic spell ("tile", "fuse", "skew"...) is an old bad habit which is not
1007 relevant anymore in the polyhedral model: a scheduling may be an arbitrary
1008 complex sequence of basic-old-good transformations. Nevertheless it is most
1009 of the time quite easy to translate well known transformations to schedules.
1010 For instance, let us consider this new scheduling function:
1011
1012 @example
1013 @group
1014 @tex
1015 $\theta_{S4}(i,j) = (j,i)$
1016 @end tex
1017 @end group
1018 @end example
1019
1020 @ifnottex
1021 @example
1022 @group
1023 T_S4(i,j) = (j,i)
1024 @end group
1025 @end example
1026 @end ifnottex
1027
1028 @noindent Using CLooG, we can generate the target code:
1029
1030 @example
1031 @group
1032 for (t1 = 2; t1 <= 4; t1++) @{
1033   for (t2 = 2; t2 <= 4; t2++) @{
1034     i = t2;
1035     j = t1;
1036     P[i+j] += A[i] + B[j]; // S4
1037   @}
1038 @}
1039 @end group
1040 @end example
1041
1042
1043 @noindent It is easy to see (and analyze) that it corresponds to a classical
1044 @emph{loop interchange} transformation.
1045
1046 A very useful example of multi-dimensional scattering functions is
1047 the @strong{scheduling of the original program}.
1048 The method to compute it is quite simple (@pxref{Fea92}). The idea is to
1049 build an abstract syntax tree of the program and to read the scheduling for
1050 each statement. For instance, let us consider the following implementation of
1051 a Cholesky factorization:
1052
1053 @example
1054 @group
1055 /* A Cholesky factorization kernel. */
1056 for (i=1;i<=N;i++) @{
1057   for (j=1;j<=i-1;j++) @{
1058     a[i][i] -= a[i][j] ;           // S1
1059   @}
1060   a[i][i] = sqrt(a[i][i]) ;        // S2
1061   for (j=i+1;j<=N;j++) @{
1062     for (k=1;k<=i-1;k++) @{
1063       a[j][i] -= a[j][k]*a[i][k] ; // S3
1064     @}
1065     a[j][i] /= a[i][i] ;           // S4
1066     @}
1067   @}
1068 @}
1069 @end group
1070 @end example
1071
1072 @noindent The corresponding abstract syntax tree is shown in the following
1073 figure. It directly gives the scattering functions (schedules) for all the
1074 statements of the program (just follow the paths).
1075
1076 @image{images/tree,6cm}
1077
1078 @tex
1079 $$
1080 \hbox{$ \cases{ \theta _{S1}(i,j)    &$=  (0,i,0,j,0)$\cr
1081                 \theta _{S2}(i)      &$=  (0,i,1)$\cr
1082                 \theta _{S3}(i,j,k)  &$=  (0,i,2,j,0,k,0)$\cr
1083                 \theta _{S4}(i,j)    &$=  (0,i,2,j,1)$}$}
1084 $$
1085 @end tex
1086
1087 @ifnottex
1088 @example
1089 @group
1090 T_S1(i,j)   = (0,i,0,j,0)
1091 T_S2(i)     = (0,i,1)
1092 T_S3(i,j,k) = (0,i,2,j,0,k,0)
1093 T_S4(i,j)   = (0,i,2,j,1)
1094 @end group
1095 @end example
1096 @end ifnottex
1097
1098 @noindent These schedules depend on the iterators and give for each instance
1099 of each statement a unique execution date. Using such scattering functions
1100 allows CLooG to re-generate the input code.
1101
1102 @noindent To easily manipulate the scattering function of any
1103 statement @code{S}, we translate it to the matrix form:
1104 @tex
1105 @math{\theta_S}(@emph{iteration vector})
1106 @end tex
1107 @ifnottex
1108 T_S(@emph{iteration vector})
1109 @end ifnottex
1110 @code{ = }@strong{scattering matrix}@code{ * }@emph{iteration vector}.
1111 For instance let us consider again our previous example
1112 @tex
1113 $\theta_{S4}(i, j) = (j+2, 3*i+j).$
1114 @end tex
1115 @ifnottex
1116 T_S4(i,j) = (j+2,3*i+j).
1117 @end ifnottex
1118 We write it in the following way:
1119 @tex
1120 $$
1121 \theta_{S4}
1122 \left(
1123 \matrix{
1124  i \cr
1125  j \cr
1126  1
1127  }
1128 \right)
1129 =
1130 \left[
1131 \matrix{
1132  0 &  0  & 2 \cr
1133  3 &  1  & 0
1134  }
1135 \right]
1136 \left(
1137 \matrix{
1138  i \cr
1139  j \cr
1140  1
1141  }
1142 \right)
1143 $$
1144 @end tex
1145 @ifnottex
1146 @example
1147 @group
1148      [ i ]    [  0  1  2 ]   [ i ]
1149 T_S4([ j ]) = [  3  1  0 ] * [ j ]
1150      [ 1 ]                   [ 1 ]
1151 @end group
1152 @end example
1153 @end ifnottex
1154
1155 @noindent @strong{The scattering matrix will be used in all our tools to provide
1156 the information on the scattering of a given statement
1157 (the iteration vector is in general an implicit information).}
1158
1159
1160 @node Access Function
1161 @subsection Access Function
1162
1163 Before applying any transformation, it is essential to deeply analyze both the
1164 original program and the transformation to ensure the transformation does not
1165 imply any modification of the original program semantics. In the polyhedral
1166 model, we are able to achieve
1167 an exact analysis when all the memory accesses are made through arrays
1168 (note that variables are a particular case of arrays since they are simply
1169 arrays with only one memory location) with affine subscripts that depend
1170 on outer loop counters and global parameters (note that @emph{subscripts}
1171 are sometimes called @emph{index} or @emph{accesses} in the literature).
1172
1173 For instance let us consider the array access @code{A[2*i+j][j][i+N]}. It has
1174 three dimensions, each subscript dimension is an affine form of some outer loop
1175 iterarors (@code{i} and @code{j}) and global parameters (@code{N}) hence
1176 it corresponds to an acceptable array access to be analyzed in the
1177 polyhedral model.
1178
1179 Each array access can target a different memory cell depending on the
1180 statement instance, i.e., depending on the iteration vector.
1181 Thus we use access functions (or subscript functions, or index functions, as you
1182 prefer to call it) depending on the iteration vector to describe an array
1183 access. In our example, the access function would be written
1184 @math{F_A(i, j) = (2*i+j, j, i+N)}.
1185
1186 @noindent To easily manipulate the access function of any
1187 array @code{A}, we translate it to the matrix form:
1188 @math{F_A}(@emph{iteration vector})
1189 @code{ = }@strong{access matrix}@code{ * }@emph{iteration vector}.
1190 For instance let us consider again our previous example. We would
1191 write it in the following way:
1192 @tex
1193 $$
1194 F_A
1195 \left(
1196 \matrix{
1197  i \cr
1198  j \cr
1199  N \cr
1200  1
1201  }
1202 \right)
1203 =
1204 \left[
1205 \matrix{
1206  2 &  1  &  0 &  0 \cr
1207  0 &  1  &  0 &  0 \cr
1208  1 &  0  &  1 &  0
1209  }
1210 \right]
1211 \left(
1212 \matrix{
1213  i \cr
1214  j \cr
1215  N \cr
1216  1
1217  }
1218 \right)
1219 $$
1220 @end tex
1221 @ifnottex
1222 @example
1223 @group
1224     [ i ]    [  2  1  0  0 ]   [ i ]
1225 F_A([ j ]) = [  0  1  0  0 ] * [ j ]
1226     [ N ]    [  1  0  1  0 ]   [ N ]
1227     [ 1 ]                      [ 1 ]
1228 @end group
1229 @end example
1230 @end ifnottex
1231
1232 @noindent @strong{The access matrix will be used in all our tools to provide the
1233 information on the access of a given statement
1234 (the iteration vector is in general an implicit information).}
1235
1236 @node What's Next?
1237 @section What's Next?
1238
1239 OK, now you have an idea about how we do represent a program part in the
1240 polyhedral model. You know the three main concepts, namely: domain, scattering
1241 and access. What can you do with this?! Well, pretty much anything related
1242 to code restructuring! The core idea will be to rely on the mathematical
1243 representation to extract useful information about your
1244 code (data reuse, parallelism...) and to generate a scattering
1245 to benefit from the properties you analysed.
1246 However, OpenScop's documentation is not the right
1247 place to learn at this (OpenScop is all about representation, not about
1248 manipulation). Probably it is the right time for you to
1249 have a look at:
1250 @itemize @bullet
1251 @item PoCC @url{http://pocc.sourceforge.net}
1252 @item Pluto @url{http://pluto-compiler.sourceforge.net}
1253 @end itemize
1254
1255 @noindent Have fun :-) !
1256
1257
1258 @c %  *********************** OpenScop Specification **************************
1259 @node OpenScop Specification
1260 @chapter OpenScop Specification
1261
1262 OpenScop provides an explicit polyhedral representation of a static control
1263 part. It has been designed by various polyhedral compilation tool writers from
1264 various institutions. It builds on previous popular polyhedral file and data
1265 structure formats (such as @emph{.cloog} and CLooG data structures) to provide
1266 a unique, extensible format to most polyhedral compilation tools. It
1267 is composed of two parts. The first part, the so-called
1268 @emph{core part}, is devoted to the polyhedral representation of a SCoP.
1269 It contains what is strictly necessary to build a
1270 complete source-to-source framework in the polyhedral model and to output a
1271 semantically equivalent code for the SCoP, from analysis to code generation.
1272 The second part of the format, the so-called @emph{extension part},
1273 contains extensions to provide additional informations to some tools.
1274
1275 @menu
1276 * Preliminary Example::
1277 * OpenScop File Format Specification::
1278 * OpenScop Data Structure Specification::
1279 * Extensions::
1280 * History::
1281 @end menu
1282
1283 @c %/*************************************************************************
1284 @c % *                         PRELIMINARY EXAMPLE                           *
1285 @c % *************************************************************************/
1286 @node Preliminary Example
1287 @section Preliminary Example
1288 OpenScop is a specification for representing static control program parts in
1289 the polyhedral model. Thus, we can translate some code parts to an equivalent
1290 OpenScop representation. As an example, let us consider the
1291 following matrix-multiply kernel:
1292
1293 @example
1294 #pragma scop
1295 if (N > 0) @{
1296   for (i = 0; i < N; i++) @{
1297     for (j = 0; j < N; j++) @{
1298       C[i][j] = 0.0;
1299       for (k = 0; k < N; k++)
1300         C[i][j] = C[i][j] + A[i][k] * B[k][j];
1301     @}
1302   @}
1303 @}
1304 @end example
1305
1306 The Clan@footnote{@url{http://www.lri.fr/~bastoul/development/clan/}}
1307 tool may be used to translate this code part to an OpenScop
1308 representation automatically. The @code{#pragma scop} is used here for
1309 Clan, but some other tool may not need it. Here is the result of the
1310 translation to an OpenScop textual representation.
1311
1312 @sp 1
1313 @center @strong{DON'T PANIC}
1314 @sp 1
1315
1316 @noindent Explanations will follow and it is not
1317 as cryptic as it seems to be !
1318 @sp 1
1319
1320 @example
1321 <OpenScop>
1322
1323 # =============================================== Global
1324 # Backend Language
1325 C
1326
1327 # Context
1328 CONTEXT
1329 1 3 0 0 0 1
1330 # e/i | N  | 1
1331    1    1   -1    ## N-1 >= 0
1332
1333 # Parameter names are provided
1334 1
1335 # Parameter names
1336 <strings>
1337 N
1338 </strings>
1339
1340 # Number of statements
1341 2
1342
1343 # =============================================== Statement 1
1344 # Number of relations describing the statement
1345 3
1346
1347 # ----------------------------------------------  1.1 Domain
1348 DOMAIN
1349 4 5 2 0 0 1
1350 # e/i | i    j  | N  | 1
1351    1    1    0    0    0    ## i >= 0
1352    1   -1    0    1   -1    ## -i+N-1 >= 0
1353    1    0    1    0    0    ## j >= 0
1354    1    0   -1    1   -1    ## -j+N-1 >= 0
1355
1356 # ----------------------------------------------  1.2 Scattering
1357 SCATTERING
1358 5 10 5 2 0 1
1359 # e/i| s1   s2   s3   s4   s5  | i    j  | N  | 1
1360    0   -1    0    0    0    0    0    0    0    0    ## s1 = 0
1361    0    0   -1    0    0    0    1    0    0    0    ## s2 = i
1362    0    0    0   -1    0    0    0    0    0    0    ## s3 = 0
1363    0    0    0    0   -1    0    0    1    0    0    ## s4 = j
1364    0    0    0    0    0   -1    0    0    0    0    ## s5 = 0
1365
1366 # ----------------------------------------------  1.3 Access
1367 WRITE
1368 3 8 3 2 0 1
1369 # e/i| Arr  [1]  [2] | i    j  | N  | 1
1370    0   -1    0    0    0    0    0    1    ## C
1371    0    0   -1    0    1    0    0    0    ##  [i]
1372    0    0    0   -1    0    1    0    0    ##     [j]
1373
1374 # ----------------------------------------------  1.4 Statement Extensions
1375 # Number of Statement Extensions
1376 1
1377 <body>
1378 # Number of original iterators
1379 2
1380 # Original iterator names
1381 i j
1382 # Statement body expression
1383 C[i][j] = 0.0;
1384 </body>
1385
1386 # =============================================== Statement 2
1387 # Number of relations describing the statement
1388 5
1389
1390 # ----------------------------------------------  2.1 Domain
1391 DOMAIN
1392 6 6 3 0 0 1
1393 # e/i|  i    j    k  | N  | 1
1394    1    1    0    0    0    0    ## i >= 0
1395    1   -1    0    0    1   -1    ## -i+N-1 >= 0
1396    1    0    1    0    0    0    ## j >= 0
1397    1    0   -1    0    1   -1    ## -j+N-1 >= 0
1398    1    0    0    1    0    0    ## k >= 0
1399    1    0    0   -1    1   -1    ## -k+N-1 >= 0
1400
1401 # ----------------------------------------------  2.2 Scattering
1402 SCATTERING
1403 7 13 7 3 0 1
1404 # e/i| s1   s2   s3   s4   s5   s6   s7  | i    j    k  | N  | 1
1405    0   -1    0    0    0    0    0    0    0    0    0    0    0   ## s1 = 0
1406    0    0   -1    0    0    0    0    0    1    0    0    0    0   ## s2 = i
1407    0    0    0   -1    0    0    0    0    0    0    0    0    0   ## s3 = 0
1408    0    0    0    0   -1    0    0    0    0    1    0    0    0   ## s4 = j
1409    0    0    0    0    0   -1    0    0    0    0    0    0    1   ## s5 = 1
1410    0    0    0    0    0    0   -1    0    0    0    1    0    0   ## s6 = k
1411    0    0    0    0    0    0    0   -1    0    0    0    0    0   ## s7 = 0
1412
1413 # ----------------------------------------------  2.3 Access
1414 WRITE
1415 3 9 3 3 0 1
1416 # e/i| Arr  [1]  [2] | i    j    k  | N  | 1
1417    0   -1    0    0    0    0    0    0    1    ## C
1418    0    0   -1    0    1    0    0    0    0    ##  [i]
1419    0    0    0   -1    0    1    0    0    0    ##     [j]
1420
1421 READ
1422 3 9 3 3 0 1
1423 # e/i| Arr  [1]  [2] | i    j    k  | N  | 1
1424    0   -1    0    0    0    0    0    0    1    ## C
1425    0    0   -1    0    1    0    0    0    0    ##  [i]
1426    0    0    0   -1    0    1    0    0    0    ##     [j]
1427
1428 READ
1429 3 9 3 3 0 1
1430 # e/i| Arr  [1]  [2] | i    j    k  | N  | 1
1431    0   -1    0    0    0    0    0    0    2    ## A
1432    0    0   -1    0    1    0    0    0    0    ##  [i]
1433    0    0    0   -1    0    0    1    0    0    ##     [k]
1434
1435 READ
1436 3 9 3 3 0 1
1437 # e/i| Arr  [1]  [2] | i    j    k  | N  | 1
1438    0   -1    0    0    0    0    0    0    3    ## B
1439    0    0   -1    0    0    0    1    0    0    ##  [k]
1440    0    0    0   -1    0    1    0    0    0    ##     [j]
1441
1442 # ----------------------------------------------  1.4 Statement Extensions
1443 # Number of Statement Extensions
1444 1
1445 <body>
1446 # Number of original iterators
1447 3
1448 # Original iterator names
1449 i j k
1450 # Statement body expression
1451 C[i][j] = C[i][j] + A[i][k] * B[k][j];
1452 </body>
1453
1454 # =============================================== Extensions
1455 <comment>
1456 May the power of the polyhedral model be with you.
1457 </comment>
1458
1459 </OpenScop>
1460 @end example
1461
1462
1463 We will not describe here precisely the structure and the components of this
1464 output, this is described in depth in a further section
1465 (@pxref{OpenScop File Format Specification}). This format
1466 has been designed to be a possible input or output file format of most of
1467 polyhedral compilation tools. If you read the description of the polyhedral
1468 representation of programs, you should already feel familiar with this file
1469 format (@pxref{Polyhedral Representation}).
1470
1471
1472 @c %/*************************************************************************
1473 @c % *                       FILE FORMAT SPECIFICATION                       *
1474 @c % *************************************************************************/
1475 @node OpenScop File Format Specification
1476 @section OpenScop File Format Specification
1477
1478 @menu
1479 * Relations::
1480 * Generics::
1481 @end menu
1482
1483 The following grammar describes the structure of the
1484 OpenScop file format where terminals are preceeded by "_". Except
1485 stated otherwise, there can be at most one terminal per line in the file.
1486 Moreover, each line may finish with a comment, starting by the @samp{#}
1487 character. Each relevant part will be explained in more details momentarily:
1488
1489 @example
1490 OpenScop              ::= Start_tag Data End_tag
1491 Start_tag             ::= "<OpenScop>"
1492 End_tag               ::= "</OpenScop>"
1493 Data                  ::= Context Statements Extension_list
1494 Context               ::= Language Parameter_Domain Parameters
1495 Statements            ::= Nb_statements Statement_list
1496 Statement_list        ::= Statement Statement_list | (void)
1497 Relation_list         ::= _Relation Relation_list  | (void)
1498 Extension_list        ::= _Generic  Extension_list | (void)
1499 Statement             ::= Statement_relations Statement_extensions
1500 Statement_extensions  ::= Num_Extensions Extension_list
1501 Parameters            ::= "0" | "1" Parameter_information
1502 Statement_relations   ::= Nb_relations Relation_List
1503 Parameter_domain      ::= _Relation
1504 Language              ::= _String
1505 Nb_Relations          ::= _Integer
1506 Num_extensions        ::= _Integer
1507 Parameter_information ::= _Generic
1508 @end example
1509
1510 The @samp{Context} and the @samp{Statements} parts compose the
1511 @emph{core part}, i.e., what is strictly necessary to build
1512 a complete source to source framework based on OpenSCop:
1513 @itemize @bullet
1514 @item  @samp{Context} represents the global information of the SCoP. It
1515        consists on the target language, the global constraints on the
1516        parameters and optionally the parameter information which may be necessary
1517        for the code generation process. The constraints on the parameters
1518        are represented as a relation (@pxref{Context Domain Relation}).
1519        The parameter information is optional. It is preceded by a
1520        boolean which precises whether it is provided or not.
1521        It is a generic information (@pxref{Generics}), a @code{strings}
1522        (@pxref{Strings Generic}) for instance.
1523 @item  @samp{Statements} represents the information about the statements.
1524        @samp{Nb_statements} is the number of statements in the SCoP,
1525        i.e. the number of @samp{Statement} items in the @samp{Statement_list}.
1526        @samp{Statement} represents the information on a given statement.
1527        To each statement is associated a list of relations and,
1528        optionaly, a list of statement extensions. The list of relations
1529        may include one iteration domain (@pxref{Iteration Domain Relation}),
1530        one scattering relation (@pxref{Scattering Relation})
1531        and several access relations (@pxref{Access Relation}).
1532        There is no mandatory ordering, but for consistency reason it would
1533        be much appreciated that iteration domain comes first (if present)
1534        then scattering (if present), then accesses (if present).
1535        The statement extensions is an optional information. It starts with a
1536        integer which precises the number of extensions provided.
1537        It is generic information (@pxref{Generics}), a @code{body}
1538        (@pxref{Body Generic}) for instance.
1539 @end itemize
1540
1541 The @samp{Extension_list} represents the @emph{extension part} and may contain
1542 an arbitrary number of generic informations (@pxref{Generics}).
1543 Few examples of possible extensions are presented in a further
1544 section (@pxref{Extensions}).
1545
1546 As shown by the grammar, the input file describes the various pieces of
1547 information based on strings, integers, @emph{relations} and @emph{generics}.
1548 Relations and Generics are specific to OpenScop and are described in depth
1549 in the following Sections (@pxref{Relations} and @pxref{Generics}).
1550
1551
1552 @c %/*************************************************************************
1553 @c % *                              RELATIONS                                *
1554 @c % *************************************************************************/
1555 @node Relations
1556 @subsection Relations
1557
1558 @menu
1559 * Iteration Domain Relation::
1560 * Context Domain Relation::
1561 * Scattering Relation::
1562 * Access Relation::
1563 @end menu
1564
1565 @emph{Relations} are the essence of the OpenScop format and contain the
1566 "polyhedral" information. They are used to describe either an iteration
1567 domain, or a context domain, or a scattering or a memory access.
1568
1569 We use the relation term as a shortcut to denote a
1570 union of convex relations, each element of the union being described by a set of
1571 constraints in an extended PolyLib format (@pxref{Wil93}).
1572 The number of elements in the union is given by an integer on the first line,
1573 optionally followed by a comment starting with @samp{#}.
1574 This number of elements can be omitted when there is only one element.
1575 Each element in the union has the following syntax:
1576
1577 @enumerate
1578 @item Some optional comment lines beginning with @samp{#}.
1579 @item A line with the type of the relation, possibly followed by comments.
1580       The type can be one of the following:
1581       @itemize @bullet
1582       @item @code{UNDEFINED}: generic relation,
1583       @item @code{CONTEXT}: for context information,
1584       @item @code{DOMAIN}: for iteration domains,
1585       @item @code{SCATTERING}: for scattering relation,
1586       @item @code{READ}: for read accesses,
1587       @item @code{WRITE}: for write accesses,
1588       @item @code{MAY_WRITE}: for may-write accesses,
1589       @end itemize
1590 @item A line with 6 numbers, possibly followed by comments:
1591       @enumerate
1592       @item the number of rows of the constraint matrix,
1593       @item the number of columns of the constraint matrix,
1594       @item the number of @emph{output dimensions},
1595       @item the number of @emph{input dimension},
1596       @item the number of @emph{local dimensions}
1597             (existentially quantified dimensions),
1598       @item the number of @emph{parameters}.
1599       @end enumerate
1600       The sum of the last four numbers should be equal to the number of columns
1601       minus two. The remaining two columns are the equality/inequality
1602       indicator and the constant term. The number of parameters should be the
1603       same for all relations in the entire OpenScop file or data structure.
1604 @item The constraint rows. Each row corresponds to a constraint the
1605       relation has to satisfy. Each row must be on a single line and is possibly
1606       followed by comments. The constraint is an equality @math{p(x) = 0} if the
1607       first element is 0, an inequality  @math{p(x) \geq 0} if the first element
1608       is 1. The next elements are the coefficients of the output dimensions,
1609       followed by coefficients of the input dimensions, the existentially
1610       quantified dimensions and finally the parameters.
1611       The last element is the constant term.
1612 @end enumerate
1613
1614 This representation is the basis for several purposes. Examples for
1615 iteration domains (@pxref{Iteration Domain Relation}), context domains
1616 (@pxref{Context Domain Relation}), scattering
1617 relations (@pxref{Scattering Relation}) and
1618 access relations (@pxref{Access Relation}) are provided in further sections.
1619
1620 @c %/**************************  ITERATION DOMAIN  ****************************
1621 @node Iteration Domain Relation
1622 @subsubsection Iteration Domain Relation
1623
1624 Iteration domain represents the set of instances of the corresponding statement.
1625 OpenScop iteration domains are represented as relations with the following
1626 conventions:
1627 @itemize @bullet
1628 @item the type is @code{DOMAIN},
1629 @item there is 0 input dimension,
1630 @item loop iterators correspond to output dimensions.
1631 @end itemize
1632
1633 @noindent For instance, assuming that @samp{i}, @samp{j} and @samp{k} are the loop
1634 iterators and @samp{M} and @samp{N} are the parameters, the domain defined by
1635 the following constraints :
1636
1637 @tex
1638 $$
1639 \hbox{$ \cases{ -i     + M &$\geq 0$\cr
1640                     -j + N &$\geq 0$\cr
1641                  i + j - k &$\geq 0$}$}
1642 $$
1643 @end tex
1644 @ifnottex
1645 @example
1646 @group
1647    -i + M >= 0
1648    -j + N >= 0
1649 i + j - k >= 0
1650 @end group
1651 @end example
1652 @end ifnottex
1653
1654 @noindent can be written in the input file as follows:
1655
1656 @example
1657 @group
1658 # This is an iteration domain
1659 DOMAIN
1660 1 # Number of relations in the union
1661 3 7 3 0 0 2                  # 3 rows, 7 cols: 3 output dims and 2 params
1662 # e/i| i   j   k | M   N | 1
1663    1  -1   0   0   1   0   0 #    -i + M >= 0
1664    1   0  -1   0   0   1   0 #    -j + N >= 0
1665    1   1   1  -1   0   0   0 # i + j - k >= 0
1666 @end group
1667 @end example
1668
1669 @noindent Equivalently, it can be written in the following way as the number
1670 of relations in the union can be omitted if it is 1:
1671
1672 @example
1673 @group
1674 # This is an iteration domain
1675 DOMAIN
1676 3 7 3 0 0 2                  # 3 rows, 7 cols: 3 output dims and 2 params
1677 # e/i| i   j   k | M   N | 1
1678    1  -1   0   0   1   0   0 #    -i + M >= 0
1679    1   0  -1   0   0   1   0 #    -j + N >= 0
1680    1   1   1  -1   0   0   0 # i + j - k >= 0
1681 @end group
1682 @end example
1683
1684 @noindent As an example for unions, let us consider the following pseudo-code:
1685
1686 @example
1687 @group
1688 for (i = 1; i <= N; i++) @{
1689   if ((i >= M) || (i <= 2*M))
1690     S1(i);
1691 @}
1692 @end group
1693 @end example
1694
1695 @noindent The iteration domain of @samp{S1} can be divided into two
1696 relations and written in the OpenScop file as follows:
1697
1698 @example
1699 @group
1700 # This is an iteration domain
1701 DOMAIN
1702 2 # Number of relations in the union
1703 # Union part No.1
1704 3 5 1 0 0 2          # 3 rows, 5 cols: 1 output dim and 2 params
1705 # e/i| i | M   N | 1
1706    1   1   0   0  -1 #  i >= 1
1707    1  -1   0   1   0 #  i <= N
1708    1   1  -1   0   0 #  i >= M
1709 # Union part No.2
1710 3 5 1 0 0 2          # 3 rows, 5 cols: 1 output dim and 2 params
1711 # e/i| i | M   N | 1
1712    1   1   0   0  -1 #  i >= 1
1713    1  -1   0   1   0 #  i <= N
1714    1  -1   2   0   0 #  i <= 2*M
1715 @end group
1716 @end example
1717
1718 @noindent As an example for local dimensions (existentially quantified
1719 dimensions), let us consider the following pseudo-code:
1720
1721 @example
1722 @group
1723 for (i = 1; i <= N; i++) @{
1724   if ((i % 2) == 0)
1725     S1(i);
1726 @}
1727 @end group
1728 @end example
1729
1730 @noindent The iteration domain of @samp{S1} is composed of all even
1731 integer values between 1 and N. The "divisible by two" constraint can
1732 be expressed as follows: there exists an integer @samp{ld} such that
1733 @samp{i = 2*ld}. We encode this thanks to a new local dimension:
1734
1735 @example
1736 @group
1737 # This is an iteration domain
1738 DOMAIN
1739 3 5 1 0 1 1          # 3 rows, 5 cols: 1 output dim, 1 local dim, 1 param
1740 # e/i| i |ld | N | 1
1741    0   1  -2   0   0 #  i = 2*ld
1742    1   1   0   0   1 #  i >= 1
1743    1  -1   0   1   0 #  i <= N
1744 @end group
1745 @end example
1746
1747
1748 @c %/**************************  CONTEXT DOMAIN  ******************************
1749 @node Context Domain Relation
1750 @subsubsection Context Domain Relation
1751
1752 The context domain is a particular case of iteration domain
1753 (@pxref{Iteration Domain Relation}) where there are only
1754 constraints about parameters (no loop iterators). Hence it is the same
1755 as an iteration domain, with the following conventions:
1756 @itemize @bullet
1757 @item the type is @code{CONTEXT},
1758 @item there is 0 input dimension,
1759 @item there is 0 output dimension.
1760 @end itemize
1761
1762
1763 @c %/*************************  SCATTERING RELATION  **************************
1764 @node Scattering Relation
1765 @subsubsection Scattering Relation
1766
1767 Scattering relation maps an iteration domain to a logical time and/or
1768 space (and/or) anything.
1769 OpenScop scattering information is represented as relations
1770 (@pxref{Relations}) with the following conventions:
1771
1772 @itemize @bullet
1773 @item the type is @code{SCATTERING},
1774 @item output dimensions correspond to scattering dimensions,
1775 @item loop iterators correspond to input dimensions.
1776 @end itemize
1777
1778 As an example of a scattering relation and
1779 assuming that @samp{i}, @samp{j} and @samp{k} are the loop
1780 iterators and @samp{M} and @samp{N} are the parameters, take for instance:
1781 @tex
1782 $\theta_{S}(i,j,k) = (j+2,3*i+j,k+N+1).$
1783 @end tex
1784 @ifnottex
1785 @example
1786 T_@{S@}(i,j,k) = (j+2,3*i+j,k+N+1).
1787 @end example
1788 @end ifnottex
1789 We can represent it in the following way:
1790
1791 @example
1792 @group
1793 # A scattering relation
1794 SCATTERING
1795 # 3 rows, 10 columns: 3 scattering dimensions, 3 iterators, 2 parameters
1796 3 10 3 3 0 2
1797 # e/i|s1  s2  s3 | i   j   k | M   N | 1
1798    0  -1   0   0   0   1   0   0   0   2 # s1 = j+2
1799    0   0  -1   0   3   1   0   0   0   0 # s2 = 3*i+j
1800    0   0   0  -1   0   0   1   0   1   1 # s3 = k+N+1
1801 @end group
1802 @end example
1803
1804 @c %/**************************  ACCESS RELATION  *****************************
1805 @node Access Relation
1806 @subsubsection Access Relation
1807
1808 Access relation maps an iteration domain to an array space.
1809 Each array accessed in the SCoP has a unique identification number.
1810 OpenScop relation information is represented as relations
1811 (@pxref{Relations}) with the following conventions:
1812
1813 @itemize @bullet
1814 @item the type is one of the following:
1815       @itemize @bullet
1816       @item @code{READ}, for read accesses,
1817       @item @code{WRITE}, for write accesses,
1818       @item @code{MAY_WRITE}, for may write accesses,
1819       @end itemize
1820 @item output dimensions correspond to the array identifier and dimensions,
1821 @item the first output dimension corresponds to the array identifier,
1822 @item the (i+1)th output dimension corresponds to the ith array dimension (i>1),
1823 @item loop iterators correspond to input dimensions.
1824 @end itemize
1825
1826 As an example of a scattering relation and
1827 assuming that @samp{i}, @samp{j} and @samp{k} are the loop
1828 iterators and @samp{M} and @samp{N} are the parameters, let us consider
1829 the array access @code{A[2*i+j][j][i+N]} (the identifier of @code{A} is 42),
1830 and let us suppose this is a read access. Its representation would be the
1831 following:
1832
1833 @example
1834 @group
1835 # A read access relation
1836 READ
1837 # 4 rows, 11 columns: 4 array dimensions, 3 iterators, 2 parameters
1838 4 11 4 3 0 2
1839 # e/i|Arr [1] [2] [3]| i   j   k | M   N | 1
1840    0  -1   0   0   0   0   0   0   0   0  42   # A
1841    0   0  -1   0   0   2   1   0   0   0   0   #  [2*i+j]
1842    0   0   0  -1   0   0   1   0   0   0   0   #         [j]
1843    0   0   0   0  -1   1   0   0   0   1   0   #            [i+N]
1844 @end group
1845 @end example
1846
1847 To understand this representation, consider that OpenScop accesses
1848 are general memory accesses and not array accesses. The memory is
1849 seen as a big array @code{Mem} while usual array names correspond to
1850 the first dimension. Hence our example translates to @code{Mem[42][2*i+j][j][i+N]}.
1851
1852 Unions of access relations are allowed. In this case, each union part must
1853 refer at the same array identifier, and the number of dimensions must be
1854 consistent.
1855
1856 @c %/*************************************************************************
1857 @c % *                               GENERICS                                *
1858 @c % *************************************************************************/
1859 @node Generics
1860 @subsection Generics
1861 @menu
1862 * Strings Generic::
1863 * Body Generic::
1864 @end menu
1865
1866 @emph{Generics} represent any elaborated non-polyhedral information in the
1867 OpenScop format. They are used to represent the parameter information, the
1868 statement body information as well as the extensions. Each generic information
1869 is delimited using XML-like tags corresponding to its URI (Unique Resource
1870 Identifier), For instance, if the generic has the URI @code{foo}, the begin
1871 tag is @code{<foo>} and the end tag is @code{</foo>}).
1872
1873 Two generics, namely @code{strings} (@pxref{Strings Generic}) and
1874 @code{body} (@pxref{Body Generic}) are part of the OpenScop
1875 specification to provide the minimum, stricly necessary information to
1876 build a complete source-to-source polyhedral framework based on OpenScop.
1877 However, generics can be basically @emph{anything} as long as they are
1878 properly delimited. OpenScop implementations will simply ignore
1879 non-supported generics and warn the user with the mention of the
1880 non-supported URIs. Support of new generics will be added throught the
1881 extension mechanism.
1882
1883 @c ---------------------------------------------------------------------------
1884
1885 @node Strings Generic
1886 @subsubsection Strings Generic
1887
1888 The purpose of the @code{strings} generic is to represent a list of
1889 textual strings on one line (which may be used, e.g., to represent the list of
1890 parameter names in the order used in the relation). Its URI is @code{strings}
1891 and its file format respects the following grammar:
1892 @example
1893 Strings_generic     ::= "<strings>" Strings "</strings>"
1894 Strings             ::= _String String_list  | (void)
1895 @end example
1896
1897 @noindent A possible example of textual @code{strings} is the following:
1898 @example
1899 @group
1900 <strings>
1901 Not one sentence but 6 strings!
1902 </strings>
1903 @end group
1904 @end example
1905
1906 @c ---------------------------------------------------------------------------
1907
1908 @node Body Generic
1909 @subsubsection Body Generic
1910
1911 The purpose of the @code{body} generic is to represent the textual
1912 information about a statement. It contains the number of original iterators on
1913 the first line, the list of original iterators on the second
1914 line (the loop counters of the statement surrounding loops in the original
1915 program) and the original textual body expression on the third line.
1916 Its URI is @code{body} and its file format respects the following grammar
1917 (the @code{String} rule is reused, @pxref{Strings Generic}):
1918 @example
1919 Body_generic        ::= "<body>" Body "</body>"
1920 Body                ::= Nb_iterators Iterator_list Expression
1921 Nb_iterators        ::= _Integer
1922 Iterator_list       ::= Strings
1923 Expression          ::= Strings
1924 @end example
1925
1926 @noindent A possible example of textual @code{body} is the following:
1927 @example
1928 @group
1929 <body>
1930 # Number of original iterators
1931 2
1932 # Original iterators
1933 i j
1934 # Original statement expression
1935 A[i+j] += B[i] * C[j];
1936 </body>
1937 @end group
1938 @end example
1939
1940
1941 @c %/*************************************************************************
1942 @c % *                      DATA STRUCTURE SPECIFICATION                     *
1943 @c % *************************************************************************/
1944 @node OpenScop Data Structure Specification
1945 @section OpenScop Data Structure Specification
1946
1947 @menu
1948 * osl_int_t::
1949 * osl_relation_t::
1950 * osl_relation_list_t::
1951 * osl_interface_t::
1952 * osl_generic_t::
1953 * osl_strings_t::
1954 * osl_body_t::
1955 * osl_statement_t::
1956 * osl_scop_t::
1957 @end menu
1958
1959 The OpenScop specification offers a small set of C data structures devoted to
1960 represent a SCoP in memory in a convenient way. Using them in some tool or
1961 library may greatly facilitate its interaction with other tools or libraries
1962 which rely on this representation as well. Every field may not be useful for
1963 a given tool or library. A general rule for all the data structure is
1964 that a @code{NULL} pointer or a -1 integer value means the information is
1965 not present. Contrary to engineering time, memory is cheap today, so it's much
1966 probably not a big deal that some fields are left empty. Every field may not
1967 be enough for a given tool or library. In this case it is much recommended
1968 to provide a new extension which may be reused by other users
1969 (@pxref{Extensions}).
1970
1971 Each tool or library may have its own implementation of the OpenScop
1972 data structures. The type names should not be the same as those provided
1973 as an example here (they correspond to the OpenScop Library implementation).
1974 The names of the fields, and their ordering, should however be the same. In this
1975 way, the interaction between tools and libraries should be as simple as a cast.
1976
1977 Before reading at the OpenScop data structures, it is much recommended to
1978 read at the OpenScop file format description, as it is quite close to this
1979 representation (@pxref{OpenScop File Format Specification}).
1980
1981 @c ---------------------------------------------------------------------------
1982
1983 @node osl_int_t
1984 @subsection osl_int_t
1985
1986 @example
1987 @group
1988 union osl_int @{
1989   long int  sp;               /* Single precision int */
1990   long long dp;               /* Double precision int */
1991   void*     mp;               /* Pointer to a multiple precision int */
1992 @};
1993 typedef union osl_int  osl_int_t;
1994 typedef union osl_int* osl_int_p;
1995 @end group
1996 @end example
1997
1998 @noindent The @code{osl_int_t} union stores an integer element. The
1999 union is used to implement the multiple precision support of OpenScop.
2000 A given implementation may or may not support a given precision type.
2001 However, dedicated functions or macros must tell the user whether a given
2002 precision type is supported or not. The @code{mp} field is a pointer to
2003 a multiple precision int, e.g., a @code{mpz_t} from the GNU GMP library.
2004
2005 @c ---------------------------------------------------------------------------
2006
2007 @node osl_relation_t
2008 @subsection osl_relation_t
2009
2010 @example
2011 @group
2012 struct osl_relation @{
2013   int type;                   /* What this relation is encoding */
2014   int precision;              /* Precision of the matrix elements */
2015   int nb_rows;                /* Number of rows */
2016   int nb_columns;             /* Number of columns */
2017   int nb_output_dims;         /* Number of output dimensions */
2018   int nb_input_dims;          /* Number of input dimensions */
2019   int nb_local_dims;          /* Number of local dimensions */
2020   int nb_parameters;          /* Number of parameters */
2021   osl_int_t** m;              /* Matrix of constraints */
2022   void* usr;                  /* User-managed field */
2023   struct osl_relation*  next; /* Next relation in the union */
2024 @};
2025 typedef struct osl_relation  osl_relation_t;
2026 typedef struct osl_relation* osl_relation_p;
2027 @end group
2028 @end example
2029
2030 @noindent The @code{osl_relation_t} structure stores a part of an
2031 union of relations. A union of relation is a @code{NULL}-terminated
2032 linked list of union parts (@code{next} field). The @code{type} field
2033 may provide some information about what the relation is encoding:
2034 @itemize @bullet
2035 @item -1: undefined (@code{OSL_UNDEFINED}),
2036 @item 2: context domain (@code{OSL_TYPE_CONTEXT}),
2037 @item 3: iteration domain (@code{OSL_TYPE_DOMAIN}),
2038 @item 4: scattering relation (@code{OSL_TYPE_SCATTERING}),
2039 @item 6: read access relation (@code{OSL_TYPE_READ}),
2040 @item 7: write access relation (@code{OSL_TYPE_WRITE}),
2041 @item 8: may write access relation (@code{OSL_TYPE_MAY_WRITE}),
2042 @end itemize
2043 The various numbers provide the details on the relation itself
2044 (@pxref{Relations}) while the @code{m} field points to
2045 the constraint matrix. The precision of the constraint matrix elements is
2046 provided by the @code{precision} field. It can take the following
2047 values:
2048 @itemize @bullet
2049 @item 32: 32 bits precision, elements are @code{long int}
2050       (@code{OSL_PRECISION_SP}),
2051 @item 64: 64 bits precision, elements are @code{long long int}
2052       (@code{OSL_PRECISION_DP}),
2053 @item 0: multiple precision, elements are GNU GMP Library's
2054       @code{mpz_t} (@code{OSL_PRECISION_MP}).
2055 @end itemize
2056 Finally, the @code{usr} field is provided for user's convenience.
2057
2058 @c ---------------------------------------------------------------------------
2059
2060 @node osl_relation_list_t
2061 @subsection osl_relation_list_t
2062
2063 @example
2064 @group
2065 struct osl_relation_list @{
2066   osl_relation_p elt;             /* Element of the list */
2067   struct osl_relation_list* next; /* Next element of the list */
2068 @};
2069 typedef struct osl_relation_list  osl_relation_list_t;
2070 typedef struct osl_relation_list* osl_relation_list_p;
2071 @end group
2072 @end example
2073
2074 @noindent The @code{osl_relation_list_t} structure is a @code{NULL}-terminated
2075 linked list of @code{osl_relation_t} data structures.
2076 @code{elt} is a relation element of the list and @code{next} is the pointer to
2077 the next element of the list.
2078
2079 @c ---------------------------------------------------------------------------
2080
2081 @node osl_interface_t
2082 @subsection osl_interface_t
2083
2084 @example
2085 @group
2086 typedef void  (*osl_idump_f) (FILE*, void*, int);
2087 typedef char* (*osl_sprint_f)(void*);
2088 typedef void* (*osl_sread_f) (char*);
2089 typedef void* (*osl_malloc_f)();
2090 typedef void  (*osl_free_f)  (void*);
2091 typedef void* (*osl_clone_f) (void*);
2092 typedef int   (*osl_equal_f) (void*, void*);
2093
2094 struct osl_interface @{
2095   char* URI;                  /* Unique interface identifier string */
2096   osl_idump_f  idump;         /* Pointer to the idump function */
2097   osl_sprint_f sprint;        /* Pointer to the sprint function */
2098   osl_sread_f  sread;         /* Pointer to the sread function */
2099   osl_malloc_f malloc;        /* Pointer to the malloc function */
2100   osl_free_f   free;          /* Pointer to the free function */
2101   osl_clone_f  clone;         /* Pointer to the clone function */
2102   osl_equal_f  equal;         /* Pointer to the equal function */
2103   struct osl_interface* next; /* Next interface in the list */
2104 @};
2105 typedef struct osl_interface  osl_interface_t;
2106 typedef struct osl_interface* osl_interface_p;
2107 @end group
2108 @end example
2109
2110 @noindent The @code{osl_interface_t} structure represents a
2111 node in a @code{NULL}-terminated list of interfaces. Each node stores the
2112 @emph{interface} of a generic OpenScop object, i.e., its unique name
2113 (@code{URI}) and the function pointers to all the base functions it has
2114 to provide. Extension providers will find information relative to those
2115 functions in the OpenScop Library description (@pxref{Base Functions})
2116 and the section dedicated to writing extensions
2117 (@pxref{Extension Development}).
2118
2119 @c ---------------------------------------------------------------------------
2120
2121 @node osl_generic_t
2122 @subsection osl_generic_t
2123
2124 @example
2125 @group
2126 struct osl_generic @{
2127   void* data;                 /* Pointer to some data */
2128   osl_interface_p interface;  /* Interface to work with the data */
2129   struct osl_generic* next;   /* Pointer to the next generic */
2130 @};
2131 typedef struct osl_generic  osl_generic_t;
2132 typedef struct osl_generic* osl_generic_p;
2133 @end group
2134 @end example
2135
2136 @noindent The @code{osl_generic_t} structure represents a node in a
2137 @code{NULL}-terminated list of generic elements. It stores some data
2138 and operations with no pre-defined type. The information is accessible
2139 through the @code{data} pointer while the type and operations are
2140 accessible through the @code{interface} pointer. It is used to represent
2141 data that are allowed to differ in implementations, such as symbols and
2142 extensions.
2143
2144 @c ---------------------------------------------------------------------------
2145
2146 @node osl_strings_t
2147 @subsection osl_strings_t
2148
2149 @example
2150 @group
2151 struct osl_string @{
2152   char** string;              /* NULL-terminated array of strings */
2153 @};
2154 typedef struct osl_strings  osl_strings_t;
2155 typedef struct osl_strings* osl_strings_p;
2156 @end group
2157 @end example
2158
2159 @noindent The @code{osl_strings_t} structure represents a NULL-terminated
2160 list of C character strings. It is encapsulated into a structure to allow
2161 its manipulation through a generic type.
2162
2163 @c ---------------------------------------------------------------------------
2164
2165 @node osl_body_t
2166 @subsection osl_body_t
2167
2168 @example
2169 @group
2170 struct osl_body @{
2171   osl_strings_p iterators;    /* Original iterators */
2172   osl_strings_p expression;   /* Original statement expression */
2173 @};
2174 typedef struct osl_body  osl_body_t;
2175 typedef struct osl_body* osl_body_p;
2176 @end group
2177 @end example
2178
2179 @noindent The @code{osl_body_t} structure stores a statement body in a
2180 textual form. The complete original expression (directly copy-pasted
2181 from the original code) is in the expression field while the textual forms
2182 of the original iterators are in the iterators field. They may be used for
2183 substitutions inside the expression.
2184
2185 @c ---------------------------------------------------------------------------
2186
2187 @node osl_statement_t
2188 @subsection osl_statement_t
2189
2190 @example
2191 @group
2192 struct osl_statement @{
2193   osl_relation_p domain;      /* Iteration domain */
2194   osl_relation_p scattering;  /* Scattering relation */
2195   osl_relation_list_p access; /* List of array access relations */
2196   osl_generic_p extension;    /* Generic extensions */
2197   void* usr;                  /* A user-defined field */
2198   struct osl_statement* next; /* Next statement in the list */
2199 @};
2200 typedef struct osl_statement  osl_statement_t;
2201 typedef struct osl_statement* osl_statement_p;
2202 @end group
2203 @end example
2204
2205 @noindent The @code{osl_statement_t} structure represents a node
2206 in a @code{NULL}-terminated linked list of statements. Each node contains the
2207 useful information for a given statement to process it within a polyhedral
2208 framework. The order in the list may matter for naming conventions
2209 (e.g. "S1" for the first statement in the list). The iteration domain
2210 and the scattering are represented using an @code{osl_relation_p}
2211 structure while the accesses are using a list of
2212 relations: one for each memory access in the statement.
2213 The extensions part is a list of generics @code{osl_generic_t} which can
2214 contain useful structures such as @code{body}.
2215 The @code{body} field should provide information about the statement body
2216 (since it has a generic type, the specification is not strict about how it
2217 is used), e.g., using the @code{osl_body_t} data structure (@pxref{osl_body_t}).
2218 It is also possible to use the @code{usr} field, but it has to be
2219 totally managed by the user.
2220
2221 @c ---------------------------------------------------------------------------
2222
2223 @node osl_scop_t
2224 @subsection osl_scop_t
2225 @example
2226 @group
2227 struct osl_scop @{
2228   int version;                /* Version of the data structure */
2229   char* language;             /* Target language */
2230   osl_relation_p context;     /* Constraints on the parameters */
2231   osl_generic_p parameters;   /* Information about parameters */
2232   osl_statement_p statement;  /* Statement list */
2233   osl_interface_p registry;   /* Registered extension interfaces */
2234   osl_generic_p extension;    /* Extension list */
2235   void* usr;                  /* A user-defined field */
2236   struct osl_scop* next;      /* Next scop in the list */
2237 @};
2238 typedef struct osl_scop  osl_scop_t;
2239 typedef struct osl_scop* osl_scop_p;
2240 @end group
2241 @end example
2242
2243 @noindent @code{osl_scop_t} represents a node in a
2244 @code{NULL}-terminated list of scops. It stores the useful informations
2245 of a static control part of a program to process it within a polyhedral
2246 framework. To prepare OpenScop specification evolution, the @code{version}
2247 field tells the version of the data structure. It should be set to 1 for
2248 now (and hopefully a very, very, long time).
2249 First, it contains the informations about the context. The target language
2250 in expressed in the @code{language} field. The constraints on the
2251 global parameters are detailed in the @code{context} field.
2252 The @code{paremeters} field should provide information about the
2253 parameters (since it has a generic type, the specification is not strict
2254 about how it is used), e.g., using the @code{osl_strings_t} data structure
2255 (@pxref{osl_strings_t}).
2256 Finally, it contains the list of statements @code{statement}, the list
2257 of registered interfaces for generic types @code{registry} and the list of
2258 extentions @code{extension}.
2259 It is also possible to use the @code{usr} field, but it has to be
2260 totally managed by the user.
2261
2262 As an example, let us consider again the matrix multiply program
2263 (@pxref{Preliminary Example}).
2264 The next figure gives a possible representation in memory for this
2265 SCoP thanks to the OpenScop data structures (it has been actually printed
2266 by the @code{osl_scop_dump} function), note that symbols like
2267 parameters, original iterators and statement expression are represented
2268 with an @code{osl_strings_t} which does not belong to the
2269 specification but to the OpenScop Library implementation:
2270
2271 @c @smallexample
2272 @example
2273 +-- osl_scop_t
2274 |    |
2275 |    Version: 1
2276 |    |
2277 |    Language: C
2278 |    |
2279 |    +-- osl_relation_t (CONTEXT, 32 bits)
2280 |    |    1 3 0 0 0 1
2281 |    |    [   1   1  -1 ]
2282 |    |
2283 |    +-- osl_generic_t
2284 |    |    |
2285 |    |    +-- osl_interface_t: URI = strings
2286 |    |    |
2287 |    |    +-- osl_strings_t: N
2288 |    |    |
2289 |    |
2290 |    +-- osl_statement_t (S1)
2291 |    |    |
2292 |    |    +-- osl_relation_t (DOMAIN, 32 bits)
2293 |    |    |    4 5 2 0 0 1
2294 |    |    |    [   1   1   0   0   0 ]
2295 |    |    |    [   1  -1   0   1  -1 ]
2296 |    |    |    [   1   0   1   0   0 ]
2297 |    |    |    [   1   0  -1   1  -1 ]
2298 |    |    |
2299 |    |    +-- osl_relation_t (SCATTERING, 32 bits)
2300 |    |    |    5 10 5 2 0 1
2301 |    |    |    [   0  -1   0   0   0   0   0   0   0   0 ]
2302 |    |    |    [   0   0  -1   0   0   0   1   0   0   0 ]
2303 |    |    |    [   0   0   0  -1   0   0   0   0   0   0 ]
2304 |    |    |    [   0   0   0   0  -1   0   0   1   0   0 ]
2305 |    |    |    [   0   0   0   0   0  -1   0   0   0   0 ]
2306 |    |    |
2307 |    |    +-- osl_relation_list_t
2308 |    |    |    |
2309 |    |    |    +-- osl_relation_t (WRITE, 32 bits)
2310 |    |    |    |    3 8 3 2 0 1
2311 |    |    |    |    [   0  -1   0   0   0   0   0   1 ]
2312 |    |    |    |    [   0   0  -1   0   1   0   0   0 ]
2313 |    |    |    |    [   0   0   0  -1   0   1   0   0 ]
2314 |    |    |    |
2315 |    |    |
2316 |    |    +-- osl_generic_t
2317 |    |    |    |
2318 |    |    |    +-- osl_interface_t: URI = body
2319 |    |    |    |
2320 |    |    |    +-- osl_strings_t: i j
2321 |    |    |    |
2322 |    |    |    +-- osl_strings_t: C[i][j] = 0.0;
2323 |    |    |    |
2324 |    |    |
2325 |    |    V
2326 |    |  osl_statement_t (S2)
2327 |    |    |
2328 |    |    +-- osl_relation_t (DOMAIN, 32 bits)
2329 |    |    |    6 6 3 0 0 1
2330 |    |    |    [   1   1   0   0   0   0 ]
2331 |    |    |    [   1  -1   0   0   1  -1 ]
2332 |    |    |    [   1   0   1   0   0   0 ]
2333 |    |    |    [   1   0  -1   0   1  -1 ]
2334 |    |    |    [   1   0   0   1   0   0 ]
2335 |    |    |    [   1   0   0  -1   1  -1 ]
2336 |    |    |
2337 |    |    +-- osl_relation_t (SCATTERING, 32 bits)
2338 |    |    |    7 13 7 3 0 1
2339 |    |    |    [   0  -1   0   0   0   0   0   0   0   0   0   0   0 ]
2340 |    |    |    [   0   0  -1   0   0   0   0   0   1   0   0   0   0 ]
2341 |    |    |    [   0   0   0  -1   0   0   0   0   0   0   0   0   0 ]
2342 |    |    |    [   0   0   0   0  -1   0   0   0   0   1   0   0   0 ]
2343 |    |    |    [   0   0   0   0   0  -1   0   0   0   0   0   0   1 ]
2344 |    |    |    [   0   0   0   0   0   0  -1   0   0   0   1   0   0 ]
2345 |    |    |    [   0   0   0   0   0   0   0  -1   0   0   0   0   0 ]
2346 |    |    |
2347 |    |    +-- osl_relation_list_t
2348 |    |    |    |
2349 |    |    |    +-- osl_relation_t (WRITE, 32 bits)
2350 |    |    |    |    3 9 3 3 0 1
2351 |    |    |    |    [   0  -1   0   0   0   0   0   0   1 ]
2352 |    |    |    |    [   0   0  -1   0   1   0   0   0   0 ]
2353 |    |    |    |    [   0   0   0  -1   0   1   0   0   0 ]
2354 |    |    |    |
2355 |    |    |    V
2356 |    |    |  osl_relation_list_t
2357 |    |    |    |
2358 |    |    |    +-- osl_relation_t (READ, 32 bits)
2359 |    |    |    |    3 9 3 3 0 1
2360 |    |    |    |    [   0  -1   0   0   0   0   0   0   1 ]
2361 |    |    |    |    [   0   0  -1   0   1   0   0   0   0 ]
2362 |    |    |    |    [   0   0   0  -1   0   1   0   0   0 ]
2363 |    |    |    |
2364 |    |    |    V
2365 |    |    |  osl_relation_list_t
2366 |    |    |    |
2367 |    |    |    +-- osl_relation_t (READ, 32 bits)
2368 |    |    |    |    3 9 3 3 0 1
2369 |    |    |    |    [   0  -1   0   0   0   0   0   0   2 ]
2370 |    |    |    |    [   0   0  -1   0   1   0   0   0   0 ]
2371 |    |    |    |    [   0   0   0  -1   0   0   1   0   0 ]
2372 |    |    |    |
2373 |    |    |    V
2374 |    |    |  osl_relation_list_t
2375 |    |    |    |
2376 |    |    |    +-- osl_relation_t (READ, 32 bits)
2377 |    |    |    |    3 9 3 3 0 1
2378 |    |    |    |    [   0  -1   0   0   0   0   0   0   3 ]
2379 |    |    |    |    [   0   0  -1   0   0   0   1   0   0 ]
2380 |    |    |    |    [   0   0   0  -1   0   1   0   0   0 ]
2381 |    |    |    |
2382 |    |    |
2383 |    |    +-- osl_generic_t
2384 |    |    |    |
2385 |    |    |    +-- osl_interface_t: URI = body
2386 |    |    |    |
2387 |    |    |    +-- osl_strings_t: i j k
2388 |    |    |    |
2389 |    |    |    +-- osl_strings_t: C[i][j] = C[i][j] + A[i][k]*B[k][j];
2390 |    |    |    |
2391 |    |    |
2392 |    |
2393 |    +-- NULL interface
2394 |    |
2395 |    +-- NULL generic
2396 |    |
2397 |
2398 @end example
2399 @c @end smallexample
2400
2401
2402 @c %/*************************************************************************
2403 @c % *                              EXTENSIONS                               *
2404 @c % *************************************************************************/
2405 @node Extensions
2406 @section Extensions
2407
2408 The core part of the OpenScop representation embeds what is strictly
2409 necessary to build a complete source-to-source polyhedral framework.
2410 However it may not be enough. Hence, OpenScop offers a very flexible
2411 extension part. Actually, the only constraint to build an extension is
2412 to request the OpenScop maintainer for a unique extension name: its URI
2413 (ask the maintainer through the OpenScop mailing list
2414 @email{openscop-development@@googlegroups.com}).
2415
2416 The policy to support extensions is the following and is pretty simple: an
2417 OpenScop implementation is not required to support any extension. If it
2418 is processing an OpenScop file or data structure which contains an
2419 extension which is not supported, it must (1) warn the user with the
2420 mention of the URI of the non-supported extension
2421 and (2) ignore this extension.
2422
2423 Extensions in an OpenScop file are provided after the core part, without
2424 any specific order. Each extension is delimited using
2425 XML-like tags corresponding to its URI (e.g., if the extension has the URI
2426 @code{foo}, the begin tag is @code{<foo>} and the end tag is @code{</foo>}).
2427 There is no specification or preferred way to write the extension body.
2428 Extensions in an OpenScop data structure must be accessible through one
2429 pointer. This pointer will be stored in the @code{data} field of an
2430 @code{osl_generic_t} container (@pxref{osl_generic_t}). There must be only
2431 one extension with the same URI in an OpenScop file or data structure.
2432
2433 Extension writers may write a short documentation about their extension to
2434 be added to this document. For consistency reason, this
2435 documentation should comply to the documentation of the
2436 @code{comment} option (@pxref{Comment Extension}). To describe the
2437 file format, it is allowed to reuse the existing rules and terminals
2438 present in the OpenScop file format description without defining them
2439 (@pxref{OpenScop File Format Specification}). By sending a
2440 documentation, you accept it to be added to this document. In
2441 particular, the sender fully accepts the license and copyright notice.
2442
2443 @menu
2444 * Comment Extension::
2445 * Arrays Extension::
2446 * Scatnames Extension::
2447 * Coordinates Extension::
2448 * Clay Extension::
2449 * Extbody Extension::
2450 * Loop Extension::
2451 * Pluto unroll Extension::
2452 * Irregular Extension::
2453 @end menu
2454
2455 @c ---------------------------------------------------------------------------
2456
2457 @node Comment Extension
2458 @subsection Comment Extension
2459
2460 @noindent @strong{Description}
2461 @itemize @bullet
2462 @item URI: @code{comment}.
2463 @item Author: C@'edric Bastoul <cedric.bastoul@@u-psud.fr>.
2464 @item Purpose: the @code{comment} extension stores a textual string.
2465 @end itemize
2466
2467 @noindent @strong{File Format}
2468
2469 @noindent The @code{comment} extension file format respects the following
2470 grammar:
2471 @example
2472 Comment_generic     ::= "<comment>" Comment "</comment>"
2473 Comment             ::= _Text
2474 @end example
2475
2476 @noindent An example of textual @code{comment} extension is the following:
2477 @example
2478 @group
2479 <comment>
2480 This is a comment string.
2481 </comment>
2482 @end group
2483 @end example
2484
2485 @noindent @strong{Data Structure}
2486
2487 @noindent The @code{comment} extension data structure is the following:
2488 @example
2489 @group
2490 struct osl_comment @{
2491   char* comment;  /* Comment message as a 0-terminated string */
2492 @};
2493 typedef struct osl_comment  osl_comment_t;
2494 typedef struct osl_comment* osl_comment_p;
2495 @end group
2496 @end example
2497
2498 @c ---------------------------------------------------------------------------
2499
2500
2501 @node Scatnames Extension
2502 @subsection Scatnames Extension
2503
2504 @noindent @strong{Description}
2505 @itemize @bullet
2506 @item URI: @code{scatnames}.
2507 @item Author: C@'edric Bastoul <cedric.bastoul@@u-psud.fr>.
2508 @item Purpose: the @code{scatnames} extension provides a list of textual
2509 scattering dimension names.
2510 @end itemize
2511
2512 @noindent @strong{File Format}
2513
2514 @noindent The @code{scatnames} extension file format respects the following
2515 grammar. It reuses the @code{Strings} description (@pxref{Strings Generic}):
2516 @example
2517 Scatnames_generic   ::= "<scatnames>" Scatnames "</scatnames>"
2518 Scatnames           ::= Strings
2519 @end example
2520
2521 @noindent The list of scattering dimension names is provided on one single
2522 line. The names are separated with spaces. A possible
2523 example of such an extension is the following:
2524
2525 @example
2526 @group
2527 <scatnames>
2528 # List of scattering dimension names:
2529 beta_0 i beta_1 j beta_2
2530 </scatnames>
2531 @end group
2532 @end example
2533
2534 @noindent @strong{Data Structure}
2535
2536 @noindent The @code{scatnames} extension data structure is the following:
2537
2538 @example
2539 @group
2540 struct osl_scatnames @{
2541   osl_strings_p names;  /* List of textual scattering dimension names. */
2542 @};
2543 typedef struct osl_scatnames  osl_scatnames_t;
2544 typedef struct osl_scatnames* osl_scatnames_p;
2545 @end group
2546 @end example
2547
2548 @noindent The order of the scattering dimension names in the list corresponds
2549 to the order of the scattering dimensions.
2550
2551 @c ---------------------------------------------------------------------------
2552
2553
2554 @node Arrays Extension
2555 @subsection Arrays Extension
2556
2557 @noindent @strong{Description}
2558 @itemize @bullet
2559 @item URI: @code{arrays}.
2560 @item Author: C@'edric Bastoul <cedric.bastoul@@u-psud.fr>.
2561 @item Purpose: the @code{arrays} extension provides a set of textual array
2562 names corresponding to the array identifiers used in the access relations.
2563 @end itemize
2564
2565 @noindent @strong{File Format}
2566
2567 @noindent The @code{arrays} extension file format respects the following
2568 grammar:
2569 @example
2570 Arrays_generic      ::= "<arrays>" Arrays "</arrays>"
2571 Arrays              ::= Nb_items Item_list
2572 Item_List           ::= Item Item_list | (void)
2573 Item                ::= Identifier Name
2574 Nb_items            ::= _Integer
2575 Identifier          ::= _Integer
2576 Name                ::= _String
2577 @end example
2578
2579 @noindent The number of array names is provided on the first line,
2580 then each following line contains a couple identifier-name.
2581 For instance, the following example is a correct textual @code{arrays}
2582 extension. It corresponds to the array names of the preliminary example
2583 (@pxref{Preliminary Example}):
2584
2585 @example
2586 @group
2587 <arrays>
2588 # Number of array names:
2589 3
2590 1 C # Identifier 1 corresponds to array name "C"
2591 3 B # Identifier 3 corresponds to array name "B"
2592 2 A # Identifier 2 corresponds to array name "A"
2593 </arrays>
2594 @end group
2595 @end example
2596
2597 @noindent @strong{Data Structure}
2598
2599 @noindent The @code{arrays} extension data structure is the following:
2600
2601 @example
2602 @group
2603 struct osl_arrays @{
2604   int nb_names;     /* Number of names */
2605   int *  id;        /* Array of nb_names identifiers */
2606   char** names;     /* Array of nb_names names */
2607 @};
2608 typedef struct osl_arrays  osl_arrays_t;
2609 typedef struct osl_arrays* osl_arrays_p;
2610 @end group
2611 @end example
2612
2613 @noindent Each name has a name string and an identifier: the ith name has name
2614 string @code{names[i]} and identifier @code{id[i]}.
2615
2616
2617 @c ---------------------------------------------------------------------------
2618
2619 @node Coordinates Extension
2620 @subsection Coordinates Extension
2621
2622 @noindent @strong{Description}
2623 @itemize @bullet
2624 @item URI: @code{coordinates}.
2625 @item Author: C@'edric Bastoul <cedric.bastoul@@u-psud.fr>.
2626 @item Purpose: the @code{coordinates} extension provides the information
2627 about the SCoP location in the original code: the original file name/path,
2628 the starting and ending lines of the SCoP in this file (inclusives) and
2629 the indentation level.
2630 @end itemize
2631
2632 @noindent @strong{File Format}
2633
2634 @noindent The @code{coordinates} extension file format respects the following
2635 grammar:
2636 @example
2637 Coordinates_generic ::= "<coordinates>" Coordinates "</coordinates>"
2638 Coordinates         ::= File_name Start_line Start_column
2639                         End_line End_column Indentation
2640 File_name           ::= _String
2641 Start_line          ::= _Integer
2642 Start_column        ::= _Integer
2643 End_line            ::= _Integer
2644 End_column          ::= _Integer
2645 Indentation         ::= _Integer
2646 @end example
2647
2648 @noindent The original file name where the SCoP has been extracted is
2649 provided on the first line, then the starting line and column numbers of the SCoP,
2650 then the ending line and column numbers of the SCoP, and lastly the indentation level
2651 (the number of spaces characters each line of the SCoP starts with).
2652 For instance, the following example is a correct textual
2653 @code{coordinates} extension:
2654
2655 @example
2656 @group
2657 <coordinates>
2658 # File name
2659 ./test/ax-do.c
2660 # Starting line and column
2661 9 3
2662 # Ending line and column
2663 15 4
2664 # Indentation
2665 2
2666 </coordinates>
2667 @end group
2668 @end example
2669
2670 @noindent @strong{Data Structure}
2671
2672 @noindent The @code{coordinates} extension data structure is the following:
2673
2674 @example
2675 @group
2676 struct osl_coordinates @{
2677   char* name;   /* File name */
2678   int   start;  /* First line of the SCoP in the source file */
2679   int   end;    /* Last line of the SCoP in the source file */
2680   int   indent; /* Indentation */
2681 @};
2682 typedef struct osl_coordinates  osl_coordinates_t;
2683 typedef struct osl_coordinates* osl_coordinates_p;
2684 @end group
2685 @end example
2686
2687
2688 @c ---------------------------------------------------------------------------
2689
2690
2691 @node Clay Extension
2692 @subsection Clay Extension
2693
2694 @noindent @strong{Description}
2695 @itemize @bullet
2696 @item URI: @code{clay}.
2697 @item Author: C@'edric Bastoul <cedric.bastoul@@u-psud.fr>.
2698 @item Purpose: the @code{clay} extension stores a Clay script.
2699 @end itemize
2700
2701 @noindent @strong{File Format}
2702
2703 @noindent The @code{clay} extension file format respects the following
2704 grammar:
2705 @example
2706 Clay_generic        ::= "<clay>" Clay "</clay>"
2707 Clay                ::= _Text
2708 @end example
2709
2710 @noindent An example of a @code{clay} extension is the following:
2711 @example
2712 @group
2713 <clay>
2714 fission([2,1], 1);
2715 stripmine([2], 32, 1);
2716 unroll([3], 4);
2717 </clay>
2718 @end group
2719 @end example
2720
2721 @noindent @strong{Data Structure}
2722
2723 @noindent The @code{clay} extension data structure is the following:
2724 @example
2725 @group
2726 struct osl_clay @{
2727   char* script;  /* Clay script as a 0-terminated string */
2728 @};
2729 typedef struct osl_clay  osl_clay_t;
2730 typedef struct osl_clay* osl_clay_p;
2731 @end group
2732 @end example
2733
2734 @c ---------------------------------------------------------------------------
2735
2736
2737 @node Extbody Extension
2738 @subsection Extbody Extension
2739
2740 @noindent @strong{Description}
2741 @itemize @bullet
2742 @item URI: @code{extbody}.
2743 @item Author: Joel Poudroux <joel.poudroux@@u-psud.fr>.
2744 @item Purpose: the @code{extbody} extension provides a list of
2745 coordinates to locate each access easily in the body string.
2746 @end itemize
2747
2748 @noindent @strong{File Format}
2749
2750 @noindent The @code{extbody} extension file format respects the following
2751 grammar.  It reuses the @code{Body} description (@pxref{Body Generic})
2752
2753 @example
2754 Extbody_generic   ::= "<extbody>" Extbody "</extbody>"
2755 Extbody           ::= Coordinate_number Coordinate_list Body
2756 Coordinate_list   ::= Access_start Access_length Coordinate_list | (void)
2757 Coordinate_number ::= _Integer
2758 Access_start      ::= _Integer
2759 Access_length     ::= _Integer
2760 @end example
2761
2762 @noindent This extension extends the @pxref{Body Generic}.  The number of
2763 accesses @code{Coordinate_number} must be equal to the number of access relations
2764 in the statement. Each coordinate is associated with the corresponding access
2765 relation in the access relation list (the order matters). It is possible that an access has
2766 two relations in the
2767 access relation list (when it is in read/write mode). In this case we can replace one
2768 of the couple of coordinates with a (-1 -1). It is possible to put twice the
2769 same coordinates, but using (-1 -1) may improve some tool efficiency (e.g., Clay would
2770 apply twice the same processing otherwise). For each access,
2771 @code{Access_start} is the index of this acces in the body string (starting from 0),
2772 and @code{Access_length} is the length of the access text. For instance:
2773
2774 @example
2775 @group
2776 <extbody>
2777 # Number of access
2778 3
2779 # Mapping start/length
2780  0  1   # a coordinates (read)
2781 -1 -1   # a coordinates (write)
2782  6  9   # b coordinates
2783 # Number of original iterators
2784 0
2785 # Statement body expression
2786 a++ + b[i+1][j];
2787 </extbody>
2788 @end group
2789 @end example
2790
2791 @noindent @strong{Data Structure}
2792
2793 @noindent The @code{extbody} extension data structure is the following:
2794
2795 @example
2796 @group
2797 struct osl_extbody @{
2798   osl_body_p body;
2799   int nb_access;   /**< Nb of access. */
2800   int * start;     /**< Array of nb_access start. */
2801   int * length;    /**< Array of nb_access length. */
2802 @};
2803 typedef struct osl_extbody  osl_extbody_t;
2804 typedef struct osl_extbody* osl_extbody_p;
2805 @end group
2806 @end example
2807
2808
2809 @c ---------------------------------------------------------------------------
2810
2811
2812 @node Loop Extension
2813 @subsection Loop Extension
2814
2815 @noindent @strong{Description}
2816 @itemize @bullet
2817 @item URI: @code{loop}.
2818 @item Author: Taj Muhammad Khan <taj.khan@@inria.fr>.
2819 @item Purpose: the @code{loop} extension provides a means to transfer
2820 information about loops in the original code. It starts with the number of loops
2821 in the SCoP.
2822 For each loop it records: the iterator name, the number of statements, the statement
2823 identifiers, the names of private variables,
2824 and an identifier for OpenMP pragma directive.
2825 @end itemize
2826
2827 @noindent @strong{File Format}
2828
2829 @noindent The @code{coordinates} extension file format respects the following
2830 grammar:
2831 @example
2832 Loop_generic            ::= "<loop>" Num_loops Loop_list "</loop>"
2833 Num_loops               ::= _Integer
2834 Loop_list               ::= Loop Loop_list | (void)
2835 Loop                    ::= Iterator_name Nb_stmts Stmt_list
2836                                         Private_variables Directive
2837 Iterator_name           ::= _String
2838 Nb_stmts                ::= _Integer
2839 Stmt_list               ::= Stmt_id Stmt_list | (void)
2840 Stmt_id                 ::= _Integer
2841 Private_variables       ::= _String
2842 Directive               ::= _Integer
2843 @end example
2844
2845
2846 @noindent With in the tags, the number of loops is provided on the first line.
2847 Next comes a series of @code{loop} structures. Within each @code{loop},
2848 the first line contains the name of the iterator, next comes the number of
2849 statements in the loop followed by their identifiers separated by spaces.
2850 Next line contains the names of the private variables and in the end is the OpenMP
2851 pragma directive. In case there are no private variables to be declared for a
2852 loop, the special string "(null)" should replace their declaration as shown
2853 below.
2854
2855 For instance, the following example is a correct textual
2856 @code{loop} extension containing two loops:
2857
2858 @example
2859 @group
2860 <loop>
2861 # Number of loops
2862 2
2863 # ===========================================
2864 # Loop number 1
2865 # Iterator name
2866 t2
2867 # Number of stmts
2868 1
2869 # Statement identifiers
2870 1
2871 # Private variables
2872 lbv,ubv
2873 # Directive
2874 1
2875 # ===========================================
2876 # Loop number 2
2877 # Iterator name
2878 t2
2879 # Number of stmts
2880 1
2881 # Statement identifiers
2882 2
2883 # Private variables
2884 (null)
2885 # Directive
2886 1
2887 </loop>
2888 @end group
2889 @end example
2890
2891 @noindent The pragma directive can have one of the defined values:
2892 @code{OSL_LOOP_DIRECTIVE_NONE}, @code{OSL_LOOP_DIRECTIVE_PARALLEL},
2893 @code{OSL_LOOP_DIRECTIVE_MPI}, or @code{OSL_LOOP_DIRECTIVE_VECTOR}.
2894
2895 @noindent @strong{Data Structure}
2896
2897 @noindent The @code{loop} extension data structure is the following:
2898
2899 @example
2900 @group
2901 struct osl_loop @{
2902   char * iter;             /* \0 terminated iterator name */
2903   int  nb_stmts;           /* Number of statements in the loop */
2904   int  *  stmt_ids;        /* Array of statement identifiers. */
2905   char * private_vars;     /* \0 terminated variable names */
2906   int  directive;          /* the OpenMP directive to implement */
2907   struct osl_loop * next;  /* pointer to the next element */
2908 @};
2909 typedef struct osl_loop   osl_loop_t;
2910 typedef struct osl_loop * osl_loop_p;
2911 @end group
2912 @end example
2913
2914
2915 @c ---------------------------------------------------------------------------
2916
2917
2918 @node Pluto unroll Extension
2919 @subsection Pluto unroll Extension
2920
2921 @noindent @strong{Description}
2922 @itemize @bullet
2923 @item URI: @code{pluto_unroll}.
2924 @item Author: L@'ena@"{@dotless{i}}c Bagn@`eres <lenaic.bagneres@@inria.fr>.
2925 @item Purpose: the @code{pluto_unroll} extension provides a means to transfer
2926 unroll information from Pluto in the SCoP.
2927 Pluto saves the iterator name, if unroll-and-jam or not and the unroll factor
2928 for several loops.
2929 @end itemize
2930
2931 @noindent @strong{File Format}
2932
2933 @noindent The @code{pluto_unroll} extension file format respects the following
2934 grammar:
2935 @example
2936 Pluto_unroll  ::= "<pluto_unroll>" Is_present Data_list "</pluto_unroll>"
2937 Is_present    ::= _Integer
2938 Iterator_name ::= _String
2939 Is_Jam        ::= _Integer
2940 Factor        ::= _Integer
2941 Data          ::= Iterator_name Is_Jam Factor Is_present
2942 Data_list     ::= Data_list Data | (void)
2943 @end example
2944
2945
2946 @noindent With in the tags, the first line indicates if data are provided.
2947 Next comes three lines: iteraror name, integer to know if jam or not and
2948 the unroll factor.
2949 Next line indicates if new data are provided, etc.
2950
2951 For instance, the following example is a correct
2952 @code{pluto_unroll} extension containing two unrolls:
2953
2954 @example
2955 @group
2956 <pluto_unroll>
2957 1
2958 # Iterator name
2959 t3
2960 # Jam
2961 1
2962 # Factor
2963 4
2964 # Next
2965 1
2966 # Iterator name
2967 t4
2968 # Jam
2969 0
2970 # Factor
2971 4
2972 # Next
2973 0
2974 </pluto_unroll>
2975 @end group
2976 @end example
2977
2978 @noindent @strong{Data Structure}
2979
2980 @noindent The @code{pluto_unroll} extension data structure is the following:
2981
2982 @example
2983 @group
2984 struct osl_pluto_unroll @{
2985   char*        iter;              /* \0 terminated iterator name */
2986   bool         jam;               /* true if jam, false otherwise */
2987   unsigned int factor;            /* unroll factor */
2988   struct osl_pluto_unroll * next; /* next @{ iter, jam, factor @} */
2989 @};
2990 typedef struct osl_pluto_unroll   osl_pluto_unroll_t;
2991 typedef struct osl_pluto_unroll * osl_pluto_unroll_p;
2992 @end group
2993 @end example
2994
2995
2996 @c ---------------------------------------------------------------------------
2997
2998
2999 @node Irregular Extension
3000 @subsection Irregular Extension
3001
3002
3003 @c ---------------------------------------------------------------------------
3004
3005 @node History
3006 @section History
3007
3008 OpenScop is a follow-up of Louis-No@"el Pouchet et al.'s ScopLib effort which
3009 was itself based on C@'edric Bastoul et al.'s Clan tool. People involved in
3010 OpenScop's genesis are:
3011 @itemize @bullet
3012 @item C@'edric Bastoul
3013 @item Uday Bondhugula
3014 @item Tobias Grosser
3015 @item Louis-No@"el Pouchet
3016 @item Sven Verdoolaege
3017 @end itemize
3018
3019 @c %/*************************************************************************
3020 @c % *                          OpenScop LIBRARY                             *
3021 @c % *************************************************************************/
3022
3023 @node OpenScop Library
3024 @chapter OpenScop Library
3025
3026 The OpenScop Library, or OSL for short, is an example implementation of the
3027 OpenScop specification. Its API is not part of the OpenScop specification.
3028 It offers basic functionalities to manipulate the OpenScop data structures
3029 (allocate, free, copy, dump, etc.) and file format (read, print, etc.).
3030 The OpenScop Library is @emph{not} a polyhedral library. OpenScop is an
3031 exchange format, and the OpenScop Library reflects this.
3032
3033 It is a Free Software using the 3-clause BSD License.
3034 Programmers should feel free to use
3035 it or copy/paste its code in any project, Open Source or not@footnote{Closed
3036 source projects should consider to provide some OpenScop file input
3037 and output, so they can be incorporated to any OpenScop-based tool chain.}.
3038
3039 @menu
3040 * Precision::
3041 * Base Functions::
3042 * Example of OpenScop Library Utilization::
3043 * Installation::
3044 * Documentation::
3045 * Development::
3046 @end menu
3047
3048 @node Precision
3049 @section Precision
3050
3051 The OpenScop specification does not impose a specific type for the
3052 constraint matrix elements. For a maximum flexibility, the OpenScop Library
3053 offers an hybrid precision implementation. It supports 32 bits, 64 bits and
3054 multiple precision (relying on GNU GMP) relations transparently. At relation
3055 allocation time, users have two ways to set the precision. The first way is
3056 to call an allocation function with a precision parameter. The second way is
3057 to rely on the environment variable @code{OSL_PRECISION}.
3058 The accepted values for this variable are @code{32} for 32 bits precision,
3059 @code{64} for 64 bits precision and @code{0} for multiple precision. When this
3060 variable is set, its value becomes the default precision for relation elements.
3061 For instance, to ensure the OpenScop Library will use 64 bits precision
3062 by default, the user may set:
3063 @example
3064 export OSL_PRECISION=64
3065 @end example
3066 @noindent if his shell is, e.g., bash or
3067 @example
3068 setenv OSL_PRECISION 64
3069 @end example
3070 @noindent if his shell is, e.g., tcsh. The user should ad this line to
3071 his .bashrc or .tcshrc (or whatever convenient file) to make this
3072 setting permanent.
3073
3074 The OpenScop Library provides the following function to know whether or not
3075 a given precision type is supported by the library or not:
3076 @example
3077 @group
3078 int osl_int_is_precision_supported(int precision);
3079 @end group
3080 @end example
3081 @noindent this function returns @code{1} if the precision type is
3082 supported, @code{0} otherwise. Possible values for the @code{precision}
3083 parameter are @code{32} for 32 bits (single) precision, @code{64} for
3084 64 bits (double) precision and @code{0} for multiple precision.
3085
3086 @node Base Functions
3087 @section Base Functions
3088
3089 The OpenScop Library provides, for each OpenScop data structure,
3090 a set of functions devoted to basic manipulation, conversion
3091 from file format to data structures and from data structures to
3092 file format. The naming convention is consistent for all data
3093 structures. Hence, the function prototypes differ only with the
3094 name of the data structure. In the following, we will use the
3095 generic term of @emph{structure} to refer at any OpenScop
3096 data structure. For instance the
3097 @code{osl_}@emph{structure}@code{_malloc()} function is a
3098 generic name can be instantiated to
3099 @code{osl_relation_malloc()} or
3100 @code{osl_statement_malloc()} etc.
3101
3102 We present in this documentation only
3103 the main functions. Many other utility functions are provided
3104 to ease OpenScop format manipulation. The reader is invited to
3105 refer at the technical documentation to learn everything about the
3106 OpenScop Library.
3107
3108 @menu
3109 * Dumping::
3110 * Printing::
3111 * Reading::
3112 * Allocating::
3113 * Deallocating::
3114 * Cloning::
3115 * Testing::
3116 @end menu
3117
3118
3119 @node Dumping
3120 @subsection Dumping: osl_@emph{structure}_dump and idump
3121
3122 @example
3123 @group
3124 void osl_@emph{structure}_dump(FILE* output, osl_@emph{structure}_p s);
3125 void osl_@emph{structure}_idump(FILE* output, osl_@emph{structure}_p s, int i);
3126 @end group
3127 @end example
3128
3129 @noindent Each OpenScop data structure has a dumping functions
3130 as shown above. Dumping means writing down the content of the data
3131 structure pointed by @code{s} (and its fields recursively)
3132 in a textual form to the
3133 @code{output} file (the file, possibly @code{stdout}, has to be open
3134 for writing). The textual form is not the OpenScop file format but
3135 another representation closer to the internal representation in
3136 memory and mainly intended for debugging purpose. The @code{idump}
3137 function has an additional integer parameter which corresponds to
3138 an indentation level.
3139
3140 @node Printing
3141 @subsection Printing: osl_@emph{structure}_print
3142
3143 @example
3144 @group
3145 void osl_@emph{structure}_print(FILE* output, osl_@emph{structure}_p s);
3146 @end group
3147 @end example
3148
3149 @noindent Each OpenScop data structure has a pretty printing function
3150 as shown above. It prints the content of the data
3151 structure pointed by @code{s} (and its fields recursively)
3152 according to the OpenScop file format
3153 (@pxref{OpenScop File Format Specification}) to the
3154 @code{output} file (the file, possibly @code{stdout}, has to be open
3155 for writing).
3156
3157 @node Reading
3158 @subsection Reading: osl_@emph{structure}_read
3159
3160 @example
3161 @group
3162 osl_@emph{structure}_p osl_@emph{structure}_read(FILE* input);
3163 @end group
3164 @end example
3165
3166 @noindent Each OpenScop data structure has a reading function
3167 as shown above. It reads the content of an OpenScop
3168 data structure written according to the OpenScop file format
3169 (@pxref{OpenScop File Format Specification}) from
3170 the @code{input} file (the file, possibly @code{stdin}, has to be open
3171 for reading). It returns a pointer to a freshly allocated
3172 @code{osl_@emph{structure}_t} structure containing the
3173 information.
3174
3175 @node Allocating
3176 @subsection Allocating: osl_@emph{structure}_malloc
3177
3178 @example
3179 @group
3180 osl_@emph{structure}_p osl_@emph{structure}_malloc();
3181 @end group
3182 @end example
3183
3184 @noindent Each OpenScop data structure has a memory allocation function
3185 as shown above (except one see below). It allocates the memory to store
3186 the corresponding data structure, it initializes the pointer fields to
3187 @code{NULL} and the integer fields to @code{OSL_UNDEFINED}
3188 (@code{-1}) and it returns a pointer to the allocated space.
3189
3190 An exception to this base description is the
3191 @code{osl_relation_malloc()} function which requires two
3192 parameters: the number of rows and columns of the constraint
3193 matrix (@pxref{Relations}):
3194
3195 @example
3196 @group
3197 osl_relation_p osl_relation_malloc(int nb_rows, int nb_columns);
3198 @end group
3199 @end example
3200
3201 @noindent The precision of the relation elements will depend on the
3202 @code{OSL_PRECISION} environment variable (@pxref{Precision}) if it is set,
3203 or the maximum available precision if it is not set. Another allocation
3204 function is provided to explicitly set a given precision:
3205
3206 @example
3207 @group
3208 osl_relation_p osl_relation_pmalloc(int precision,
3209                                     int nb_rows, int nb_columns);
3210 @end group
3211 @end example
3212
3213 @noindent The @code{precision} field may take the following values:
3214 @itemize @bullet
3215 @item @code{OSL_PRECISION_SP} for 32 bits precision,
3216 @item @code{OSL_PRECISION_DP} for 64 bits precision,
3217 @item @code{OSL_PRECISION_MP} for multiple precision,
3218 @end itemize
3219
3220 @node Deallocating
3221 @subsection Deallocating: osl_@emph{structure}_free
3222
3223 @example
3224 @group
3225 void osl_@emph{structure}_free(osl_@emph{structure}_p s);
3226 @end group
3227 @end example
3228
3229 @noindent Each OpenScop data structure has a memory deallocation function
3230 as shown above. It recursively frees the memory allocated for the
3231 structure pointed by @code{s}, i.e., internal structures are also freed.
3232
3233 @node Cloning
3234 @subsection Cloning: osl_@emph{structure}_clone
3235
3236 @example
3237 @group
3238 osl_@emph{structure}_p osl_@emph{structure}_clone(osl_@emph{structure}_p s);
3239 @end group
3240 @end example
3241
3242 @noindent Each OpenScop data structure has a clone function
3243 as shown above. It recursively copies the content of the
3244 structure pointed by @code{s}, i.e., internal structures are also copied.
3245 It returns a pointer to the clone of the structure pointed by @code{s}.
3246
3247 @node Testing
3248 @subsection Testing: osl_@emph{structure}_equal
3249
3250 @example
3251 @group
3252 int osl_@emph{structure}_equal(osl_@emph{structure}_p s1, osl_@emph{structure}_p s2);
3253 @end group
3254 @end example
3255
3256 @noindent Each OpenScop data structure has a testing function
3257 as shown above. It checks whether two pointers are referring to equivalent
3258 structures (either by pointing to the same structure or to different
3259 structures which contain the same information). It returns 1 if the
3260 pointed structures are equivalent, 0 otherwise. This test is
3261 @emph{content-based} and is intended for debugging purpose. It is not
3262 (and will never be) able to state, e.g., that two relations with
3263 different constraint matrices are actually representing the same relation.
3264
3265
3266 @node Example of OpenScop Library Utilization
3267 @section Example of OpenScop Library Utilization
3268 Here is a basic example showing how it is possible to use the
3269 OpenScop Library, assuming that a standard installation has been done.
3270 The following C program reads an OpenScop file from the standard
3271 input and dumps the content of the data structures to the standard output.
3272
3273 @example
3274 /* example.c */
3275 # include <stdio.h>
3276 # include <osl/osl.h>
3277
3278 int main() @{
3279   osl_scop_p scop;
3280
3281   // Read the OpenScop file.
3282   scop = osl_scop_read(stdin);
3283
3284   // Dump the content of the scop data structure.
3285   osl_scop_dump(stdout, scop);
3286
3287   // Save the planet.
3288   osl_scop_free(scop);
3289
3290   return 0;
3291 @}
3292 @end example
3293
3294 @noindent The compilation command could be:
3295 @example
3296 gcc example.c -losl -o example
3297 @end example
3298 @noindent A calling command with the input file test.scop could be:
3299 @example
3300 more test.scop | ./example
3301 @end example
3302
3303
3304 @c %  ****************************** INSTALLING ********************************
3305 @node Installation
3306 @section Installation
3307
3308 @menu
3309 * License::
3310 * Requirements::
3311 * Installation Instructions::
3312 * Optional Features::
3313 * Uninstallation::
3314 @end menu
3315
3316 @node License
3317 @subsection License
3318 First of all, it would be very kind to refer the present document in any
3319 publication that results from the use of the OpenScop specification or library,
3320 @pxref{Bas11} (a bibtex entry is provided behind the title page of this
3321 manual, along with the copyright notice).
3322 The OpenScop Library is provided under the 3-clause BSD license:
3323
3324 Copyright (C) 2011 University Paris-Sud 11 and INRIA
3325
3326 Redistribution and use in source and binary forms, with or without
3327 modification, are permitted provided that the following conditions
3328 are met:
3329 @enumerate
3330 @item Redistributions of source code must retain the above copyright notice,
3331       this list of conditions and the following disclaimer.
3332 @item Redistributions in binary form must reproduce the above copyrigh
3333       notice, this list of conditions and the following disclaimer in the
3334       documentation and/or other materials provided with the distribution.
3335 @item The name of the author may not be used to endorse or promote products
3336       derived from this software without specific prior written permission
3337 @end enumerate
3338
3339 THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR
3340 IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES
3341 OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED.
3342 IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT,
3343 INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT
3344 NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
3345 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
3346 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
3347 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF
3348 THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
3349
3350 @node Requirements
3351 @subsection Requirements
3352
3353 The OpenScop Library is a stand-alone library. For a basic use,
3354 it does not need any additional tool or library. Anyway, to be able to
3355 work in conjunction with other tools that manipulate multiple precision
3356 numbers, the GNU GMP library can be used as an option.
3357
3358 @menu
3359 * GMP Library::
3360 @end menu
3361
3362
3363 @node GMP Library
3364 @subsubsection GMP Library (optional)
3365
3366 To be able to deal with insanely large coefficient, the user will need to
3367 install the GNU Multiple Precision Library (GMP for short) version 4.2.2
3368 or above@footnote{@code{http://www.swox.com/gmp}}.
3369 The user can compile it by typing the following commands on the GMP root
3370 directory:
3371
3372 @itemize @bullet
3373 @item @code{./configure}
3374 @item @code{make}
3375 @item And as root: @code{make install}
3376 @end itemize
3377
3378 The GMP default installation is @code{/usr/local}. This directory may
3379 not be inside the user's library path. To fix the problem, the user should set
3380 @example
3381 export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib
3382 @end example
3383 @noindent if your shell is, e.g., bash or
3384 @example
3385 setenv LD_LIBRARY_PATH $LD_LIBRARY_PATH:/usr/local/lib
3386 @end example
3387 @noindent if your shell is, e.g., tcsh. Add the line to your .bashrc or .tcshrc (or
3388 whatever convenient file) to make this change permanent. Another solution
3389 is to ask GMP to install in the standard path by using the prefix
3390 option of the configure script:
3391 @samp{./configure --prefix=/usr}.
3392
3393 The OpenScop Library has to be built using the GMP library by specifying
3394 the convenient configure script options to buid the GMP version
3395 (@pxref{Optional Features}).
3396
3397
3398 @node Installation Instructions
3399 @subsection Installation Instructions
3400
3401 Once downloaded and unpacked
3402 (e.g. using the @samp{tar -zxvf openscop-@value{LIB_VERSION}.tar.gz} command),
3403 you can compile the OpenScop Library by typing the following commands
3404 on the OpenScop Library's root directory:
3405
3406 @itemize @bullet
3407 @item @code{./autogen.sh}
3408 @item @code{./configure}
3409 @item @code{make}
3410 @item And as root: @code{make install}
3411 @end itemize
3412
3413 The program binaries and object files can be removed from the
3414 source code directory by typing @code{make clean}. To also remove the
3415 files that the @code{configure} script created (so you can compile the
3416 package for a different kind of computer) type @code{make distclean}.
3417
3418 @node Optional Features
3419 @subsection Optional Features
3420 The @code{configure} shell script attempts to guess correct values for
3421 various system-dependent variables and user options used during compilation.
3422 It uses those values to create the @code{Makefile}. Various user options
3423 are provided by the OpenScop Library's configure script. They are summarized in the
3424 following list and may be printed by typing @code{./configure --help} in the
3425 OpenScop Library top-level directory.
3426
3427 @itemize @bullet
3428 @item By default, the installation directory is @code{/usr/local}:
3429 @code{make install} will install the package's files in
3430 @code{/usr/local/bin}, @code{/usr/local/lib} and @code{/usr/local/include}.
3431 The user can specify an installation prefix other than @code{/usr/local} by
3432 giving @code{configure} the option @code{--prefix=PATH}.
3433
3434 @item By default, The OpenScop Library supports 32 bits, 64 bits and GMP if
3435 it is installed in the standard locations. Using the @code{--with-gmp} option
3436 of @code{configure} the user can specify that no GMP (@code{--with-gmp=no}),
3437 a previously installed (@code{--with-gmp=system}, the default) GMP or a
3438 build GMP (@code{--with-gmp=build}) GMP should be used.
3439 In case of an installed GMP, the installation location can be specified
3440 using the @code{--with-isl-prefix=PATH} and if different, the installation
3441 of the library can be specified using the
3442 @code{--with-isl-exec-prefix=PATH} options of @code{configure}.
3443 In the case of a build GMP, the user can also specify the build location
3444 using @code{--with-isl-builddir=PATH}.
3445 @end itemize
3446
3447 @node Uninstallation
3448 @subsection Uninstallation
3449 The user can easily remove the OpenScop Library from his system
3450 by typing (as root if necessary) from the OpenScop Library top-level
3451 directory
3452 @code{make uninstall}.
3453
3454 @c %  **************************** DOCUMENTATION ******************************
3455 @node Documentation
3456 @section Documentation
3457 The OpenScop Library distribution provides several sources of documentation.
3458 First, the source code itself is as documented as much as possible.
3459 The code comments use the Doxygen technical documentation system.
3460 The user may install
3461 Doxygen@footnote{@code{http://www.stack.nl/~dimitri/doxygen}} to automatically
3462 generate a technical documentation by typing @code{make doc} or
3463 @code{doxygen ./autoconf/Doxyfile} at the OpenScop Library
3464 top-level directory after running the configure script
3465 (@pxref{Installation Instructions}). Doxygen will generate
3466 documentation sources (in HTML, LaTeX and man) in the @code{doc/source}
3467 directory of the OpenScop Library distribution.
3468
3469 The Texinfo source of the present document is also provided in the @code{doc}
3470 directory. The user can build it in either PDF format
3471 (by typing @code{texi2pdf openscop.texi}) or HTML format
3472 (by typing @code{makeinfo --html openscop.texi}, using @code{--no-split}
3473 option to generate a single HTML file) or info format
3474 (by typing @code{makeinfo openscop.texi}).
3475
3476 @c %  **************************** DEVELOPPING ********************************
3477 @node Development
3478 @section Development
3479
3480 @menu
3481 * Copyright Issue::
3482 * Repository::
3483 * Coding Style::
3484 * Extension Development::
3485 @end menu
3486
3487 @node Copyright Issue
3488 @subsection Copyright Issue
3489
3490 The OpenScop Library is an Open Source project and you should feel free to
3491 contribute by adding functionalities (in particular extensions), correcting
3492 bugs or improving documentation. However, for painful administrative reasons,
3493 the copyright of the core part (everything except extensions) should not be
3494 impacted by your work. Hence, if you are doing a significant contribution to
3495 the main part, the OpenScop Library maintainer may ask you for an agreement
3496 about this copyright. If you plan to do such a significant contribution, it
3497 may be wise to discuss this issue with the maintainer first. Extensions
3498 may include developer's own copyright.
3499
3500 @node Repository
3501 @subsection Repository
3502
3503 The main repository of the OpenScop Library is
3504 @url{http://repo.or.cz/w/openscop.git}. Developers may ask the OpenScop Library
3505 maintainer to open them a write access to this repository. Only the maintainer
3506 should ever change the @code{master} branch. Developers should work on their
3507 own branches. To avoid any problem developers should use the @emph{fork}
3508 functionality of the repository.
3509
3510 @node Coding Style
3511 @subsection Coding Style
3512
3513 The OpenScop Library is written in C using an object oriented style. Each
3514 important data structure (e.g., @code{struct foo}) has its own header file
3515 (@code{include/osl/foo.h}) where lies the definition of
3516 the data structure, the two typedefs for the data structure (one for the
3517 structure, @code{osl_foo_t}, and one for a pointer
3518 to the structure, @code{osl_foo_p}), the prototypes of the various
3519 functions related to this data structure, all named using the
3520 prefix "@code{osl_foo_}". The source code of the functions is provided in a
3521 separated C file (@code{source/foo.c}).
3522
3523 Utility functions independent from the main data structures may be placed in
3524 separate source files (e.g., definition in @code{include/osl/util.h}
3525 and code in @code{source/util.c}). Tool-wide preprocessor directives are
3526 placed in @code{include/osl/macros.h}, macros are prefixed with
3527 "@code{OSL_}".
3528
3529 The core code itself has to be written according to the Google C++ Coding Style:
3530 @url{http://google-styleguide.googlecode.com/svn/trunk/cppguide.xml} (for
3531 what can apply to C), plus the naming conventions discussed above with
3532 highest priority. The extension parts must only respect the naming convention,
3533 but a consistent coding style is much appreciated.
3534
3535 @node Extension Development
3536 @subsection Extension Development
3537
3538 It's fairly easy to integrate a new extension to OpenScop and the OpenScop
3539 Library. Developing a new extension is very much like adding a new "object":
3540 it requires writing a data structure for the extension data and the 7 base
3541 functions to manage this extension. Here is how developers should proceed
3542 to add an extension called @code{foo} (beware that the naming convention is
3543 strict):
3544
3545 @enumerate
3546 @item Send the name @code{foo} to the maintainer to ensure it is unique and
3547       hence can be used as an URI. The name (one single
3548       word, or words separated with underscores "_") should be
3549       suggested by the extension developers to the OpenScop development
3550       mailing list @email{openscop-development@@googlegroups.com}). It
3551       should not correspond to an existing structure name
3552       (see @code{include/osl/osl.h} for the list). The
3553       maintainer will update @code{include/osl/osl.h} in the development
3554       version accordingly.
3555 @item Look at the @code{comment} extension. The @code{comment} extension
3556       (@pxref{Comment Extension}) has been written to be used as a basic
3557       example for extension developers. Having a look at
3558       @code{include/osl/extensions/comment.h} and
3559       @code{source/extensions/comment.c} will be a great help to do it right.
3560 @item Create the extension data structure: it is necessary that the
3561       extension data can be accessible through only one pointer.
3562 @item Code the 7 base functions for the extension. Any extension must provide
3563       this set of functions (naming convention apply only if the
3564       extension is planed to be integrated to the OpenScop Library
3565       default extensions):
3566       @itemize @bullet
3567       @item @code{osl_foo_idump} (@pxref{Dumping})
3568       @item @code{osl_foo_sprint} has the following prototype:
3569 @example
3570 @group
3571 char * osl_@emph{structure}_sprint(osl_@emph{structure}_p s);
3572 @end group
3573 @end example
3574            It corresponds to the pretty printing functions of the core
3575            data structures (@pxref{Printing}) with the
3576            difference that the OpenScop textual representation is written
3577            to a string (returned by the function) instead of a file.
3578       @item @code{osl_foo_sread} has the following prototype:
3579 @example
3580 @group
3581 osl_@emph{structure}_p osl_@emph{structure}_sread(char ** string);
3582 @end group
3583 @end example
3584            It corresponds to the reading functions of the core
3585            data structures (@pxref{Reading}) with the
3586            difference that the OpenScop textual representation is read
3587            from a string (provided as a parameter) instead of a file.
3588            The address of the string to read is passed as a parameter and
3589            is updated to point immediately after what has been actually read.
3590       @item @code{osl_foo_malloc} (@pxref{Allocating})
3591       @item @code{osl_foo_free} (@pxref{Deallocating})
3592       @item @code{osl_foo_clone} (@pxref{Cloning})
3593       @item @code{osl_foo_equal} (@pxref{Testing})
3594       @end itemize
3595 @item Code the other functions you need!
3596 @end enumerate
3597
3598 Now let us consider two scenarios.
3599
3600 First scenario, the extension is external and is
3601 not planned to be integrated to the OpenScop Library. In this case you are
3602 all set. Simply generate an @code{osl_interface_t} for your
3603 extension and have a look at the function
3604 @code{osl_scop_register_extension()} which is devoted to register
3605 a new extension interface to an existing @code{osl_scop_t}.
3606
3607 Second scenario, the extension will integrate the set of default
3608 OpenScop Library extensions (the best solution to share it to other
3609 potential users). In this case, a few additional
3610 things have to be done:
3611 @enumerate
3612 @item Create the extension header
3613       @code{include/osl/extensions/foo.h} to store the extension
3614       structure and function prototypes and the
3615       extension source file @code{source/extensions/foo.c} for the code
3616       of the functions.
3617 @item Add the documentation for the extension to the texinfo source of
3618       this document (in @code{doc/openscop.texi}), following the example
3619       of the @code{comment} documentation (@pxref{Comment Extension}).
3620 @item Integrate the extension by adding the @code{extensions/foo.c} entry
3621       to the @code{libosl_la_SOURCES} in the @code{source/Makefile.am}
3622       file, the @code{osl/foo.h} entry to the
3623       @code{nobase_pkginclude_HEADERS} and add the corresponding
3624       @code{#include <osl/extensions/foo.h>} in the
3625       @code{include/scop.h.in} file.
3626 @item Add the new extension in the
3627       @code{osl_interface_get_default_registry()} function.
3628 @item You are done! Prepare a single commit or patch corresponding to the
3629       integration of the new extension and ask the maintainer to merge it
3630       to the master branch.
3631 @end enumerate
3632
3633
3634 @c %  ****************************** REFERENCES ********************************
3635 @node References
3636 @chapter References
3637
3638 @itemize
3639 @item
3640 @anchor{Bas03a}[Bas03a] C. Bastoul, P. Feautrier. Improving data locality
3641 by chunking. CC'12 International Conference on Compiler Construction,
3642 LNCS 2622, pages 320-335, Warsaw, April 2003.
3643
3644 @item
3645 @anchor{Bas11}[Bas11] C. Bastoul.
3646 OpenScop: A Specification and a Library for Data Exchange in Polyhedral
3647 Compilation Tools. Technical Report, Paris-Sud University, France, June 2011.
3648
3649 @item
3650 @anchor{Fea92}[Fea92] P. Feautrier. Some efficient solutions to the affine
3651 scheduling problem, part II: multidimensional time.
3652 International Journal of Parallel Programming, 21(6):389--420, December 1992.
3653
3654 @item
3655 @anchor{Gri04}[Gri04] M. Griebl. Automatic parallelization of loop programs
3656 for distributed memory architectures. Habilitation Thesis. Facult@"at f@"ur
3657 Mathematik und Informatik, Universit@"at Passau, 2004.
3658 @emph{http://www.infosun.fmi.uni-passau.de/cl/loopo/}
3659
3660 @item
3661 @anchor{Wil93}[Wil93] Doran K. Wilde.
3662 A library for doing polyhedral operations.
3663 Technical Report 785, IRISA, Rennes, France, 1993.
3664
3665 @end itemize
3666
3667
3668
3669
3670 @c % /*************************************************************************
3671 @c %  *                       PART VI: END OF THE DOCUMENT                    *
3672 @c %  *************************************************************************/
3673 @c @unnumbered Index
3674
3675 @c @printindex cp
3676
3677 @bye