FFI: Fix unfused loads of float fields.
[luajit-2.0.git] / doc / status.html
blobd386e1aa5db6a2fa6f906c98a15f74418001d796
1 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
2 <html>
3 <head>
4 <title>Status &amp; Roadmap</title>
5 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
6 <meta name="Author" content="Mike Pall">
7 <meta name="Copyright" content="Copyright (C) 2005-2011, Mike Pall">
8 <meta name="Language" content="en">
9 <link rel="stylesheet" type="text/css" href="bluequad.css" media="screen">
10 <link rel="stylesheet" type="text/css" href="bluequad-print.css" media="print">
11 <style type="text/css">
12 ul li { padding-bottom: 0.3em; }
13 </style>
14 </head>
15 <body>
16 <div id="site">
17 <a href="http://luajit.org"><span>Lua<span id="logo">JIT</span></span></a>
18 </div>
19 <div id="head">
20 <h1>Status &amp; Roadmap</h1>
21 </div>
22 <div id="nav">
23 <ul><li>
24 <a href="luajit.html">LuaJIT</a>
25 <ul><li>
26 <a href="install.html">Installation</a>
27 </li><li>
28 <a href="running.html">Running</a>
29 </li></ul>
30 </li><li>
31 <a href="extensions.html">Extensions</a>
32 <ul><li>
33 <a href="ext_ffi.html">FFI Library</a>
34 <ul><li>
35 <a href="ext_ffi_tutorial.html">FFI Tutorial</a>
36 </li><li>
37 <a href="ext_ffi_api.html">ffi.* API</a>
38 </li><li>
39 <a href="ext_ffi_semantics.html">FFI Semantics</a>
40 </li></ul>
41 </li><li>
42 <a href="ext_jit.html">jit.* Library</a>
43 </li><li>
44 <a href="ext_c_api.html">Lua/C API</a>
45 </li></ul>
46 </li><li>
47 <a class="current" href="status.html">Status</a>
48 <ul><li>
49 <a href="changes.html">Changes</a>
50 </li></ul>
51 </li><li>
52 <a href="faq.html">FAQ</a>
53 </li><li>
54 <a href="http://luajit.org/performance.html">Performance <span class="ext">&raquo;</span></a>
55 </li><li>
56 <a href="http://luajit.org/download.html">Download <span class="ext">&raquo;</span></a>
57 </li></ul>
58 </div>
59 <div id="main">
60 <p>
61 The <span style="color: #0000c0;">LuaJIT 1.x</span> series represents
62 the current <span style="color: #0000c0;">stable branch</span>.
63 Only a single bug has been discovered in the last two years. So, if
64 you need a rock-solid VM, you are encouraged to fetch the latest
65 release of LuaJIT 1.x from the <a href="http://luajit.org/download.html"><span class="ext">&raquo;</span>&nbsp;Download</a>
66 page.
67 </p>
68 <p>
69 <span style="color: #c00000;">LuaJIT 2.0</span> is the currently active
70 <span style="color: #c00000;">development branch</span>.
71 It has <b>Beta Test</b> status and is still undergoing
72 substantial changes.
73 It has <a href="http://luajit.org/performance.html"><span class="ext">&raquo;</span>&nbsp;much better performance</a> than LuaJIT 1.x.
74 It's maturing quickly, so you should definitely
75 start to evaluate it for new projects right now.
76 </p>
78 <h2>Current Status</h2>
79 <p>
80 This is a list of the things you should know about the LuaJIT 2.0 beta test:
81 </p>
82 <ul>
83 <li>
84 Obviously there will be some <b>bugs</b> in a VM which has been
85 rewritten from the ground up. Please report your findings together with
86 the circumstances needed to reproduce the bug. If possible, reduce the
87 problem down to a simple test case.<br>
88 There is no formal bug tracker at the moment. The best place for
89 discussion is the
90 <a href="http://www.lua.org/lua-l.html"><span class="ext">&raquo;</span>&nbsp;Lua mailing list</a>. Of course
91 you may also send your bug reports <a href="contact.html">directly to me</a>,
92 especially when they contain lengthy debug output or if you require
93 confidentiality.
94 </li>
95 <li>
96 The x86 JIT compiler only generates code for CPUs with support for
97 <b>SSE2</b> instructions. I.e. you need at least a P4, Core 2/i3/i5/i7,
98 Atom or K8/K10 to get the full benefit.<br>
99 If you run LuaJIT on older CPUs without SSE2 support, the JIT compiler
100 is disabled and the VM falls back to the LuaJIT interpreter. This is faster
101 than the Lua interpreter, but not nearly as fast as the JIT compiler of course.
102 Run the command line executable without arguments to show the current status
103 (<tt>JIT: ON</tt> or <tt>JIT: OFF</tt>).
104 </li>
105 <li>
106 The VM is complete in the sense that it <b>should</b> run all Lua code
107 just fine. It's considered a serious bug if the VM crashes or produces
108 unexpected results &mdash; please report this. There are only very few
109 known incompatibilities with standard Lua:
110 <ul>
111 <li>
112 The Lua <b>debug API</b> is missing a couple of features (return
113 hooks for non-Lua functions) and shows slightly different behavior
114 (no per-coroutine hooks, no tail call counting).
115 </li>
116 <li>
117 Some of the <b>configuration options</b> of Lua&nbsp;5.1 are not supported:
118 <ul>
119 <li>The <b>number type</b> cannot be changed (it's always a <tt>double</tt>).</li>
120 <li>The stand-alone executable cannot be linked with <b>readline</b>
121 to enable line editing. It's planned to add support for loading it
122 on-demand.</li>
123 </ul>
124 </li>
125 <li>
126 Most other issues you're likely to find (e.g. with the existing test
127 suites) are differences in the <b>implementation-defined</b> behavior.
128 These either have a good reason (like early tail call resolving which
129 may cause differences in error reporting), are arbitrary design choices
130 or are due to quirks in the VM. The latter cases may get fixed if a
131 demonstrable need is shown.
132 </li>
133 </ul>
134 </li>
135 <li>
136 The <b>JIT compiler</b> falls back to the
137 interpreter in some cases. All of this works transparently, so unless
138 you use <tt>-jv</tt>, you'll probably never notice (the interpreter is
139 <a href="http://luajit.org/performance.html"><span class="ext">&raquo;</span>&nbsp;quite fast</a>, too). Here are the known issues:
140 <ul>
141 <li>
142 Most known issues cause a <b>NYI</b> (not yet implemented) trace abort
143 message. E.g. for calls to some internal library
144 functions. Reporting these is only mildly useful, except if you have good
145 example code that shows the problem. Obviously, reports accompanied with
146 a patch to fix the issue are more than welcome. But please check back
147 with me, before writing major improvements, to avoid duplication of
148 effort.
149 </li>
150 <li>
151 The trace compiler currently doesn't back off specialization for
152 function call dispatch. It should really fall back to specializing on
153 the prototype, not the closure identity. This can lead to the so-called
154 "trace explosion" problem with <b>closure-heavy programming</b>. The
155 trace linking heuristics prevent this, but in the worst case this
156 means the code always falls back to the interpreter.
157 </li>
158 <li>
159 <b>Trace management</b> needs more tuning: less drastic countermeasures
160 against trace explosion and better heuristics in general.
161 </li>
162 <li>
163 Some checks are missing in the JIT-compiled code for obscure situations
164 with <b>open upvalues aliasing</b> one of the SSA slots later on (or
165 vice versa). Bonus points, if you can find a real world test case for
166 this.
167 </li>
168 <li>
169 Currently some <b>out-of-memory</b> errors from <b>on-trace code</b> are not
170 handled correctly. The error may fall through an on-trace
171 <tt>pcall</tt> (x86) or it may be passed on to the function set with
172 <tt>lua_atpanic</tt> (x64).
173 </li>
174 </ul>
175 </li>
176 </ul>
178 <h2>Roadmap</h2>
180 Please refer to the
181 <a href="http://lua-users.org/lists/lua-l/2011-01/msg01238.html"><span class="ext">&raquo;</span>&nbsp;LuaJIT
182 Roadmap 2011</a> for the latest release plan. Here's the general
183 project plan for LuaJIT 2.0:
184 </p>
185 <ul>
186 <li>
187 The main goal right now is to stabilize LuaJIT 2.0 and get it out of
188 beta test. <b>Correctness</b> has priority over completeness. This
189 implies the first stable release will certainly NOT compile every
190 library function call and will fall back to the interpreter from time
191 to time. This is perfectly ok, since it still executes all Lua code,
192 just not at the highest possible speed.
193 </li>
194 <li>
195 The next step is to get it to compile more library functions and handle
196 more cases where the compiler currently bails out. This doesn't mean it
197 will compile every corner case. It's much more important that it
198 performs well in a majority of use cases. Every compiler has to make
199 these trade-offs &mdash; <b>completeness</b> just cannot be the
200 overriding goal for a low-footprint, low-overhead JIT compiler.
201 </li>
202 <li>
203 More <b>optimizations</b> will be added in parallel to the last step on
204 an as-needed basis. Sinking of stores
205 to aggregates and sinking of allocations are high on the list.
206 More complex optimizations with less pay-off, such as value-range-propagation
207 (VRP) will have to wait.
208 </li>
209 <li>
210 LuaJIT 2.0 has been designed with <b>portability</b> in mind.
211 Nonetheless, it compiles to native code and needs to be adapted to each
212 architecture. The two major work items are porting the the fast interpreter,
213 which is written in assembler, and porting the compiler backend.
214 Most other portability issues like endianess or 32 vs. 64&nbsp;bit CPUs
215 have already been taken care of.<br>
216 Several ports are already available, thanks to the
217 <a href="http://luajit.org/sponsors.html"><span class="ext">&raquo;</span>&nbsp;LuaJIT sponsorship program</a>.
218 More ports will follow in the future &mdash; companies which are
219 interested in sponsoring a port to a particular architecture, please
220 use the given contact address.
221 </li>
222 <li>
223 <b>Documentation</b> about the <b>internals</b> of LuaJIT is still sorely
224 missing. Although the source code is included and is IMHO well
225 commented, many basic design decisions are in need of an explanation.
226 The rather un-traditional compiler architecture and the many highly
227 optimized data structures are a barrier for outside participation in
228 the development. Alas, as I've repeatedly stated, I'm better at
229 writing code than papers and I'm not in need of any academic merits.
230 Someday I will find the time for it. :-)
231 </li>
232 <li>
233 Producing good code for unbiased branches is a key problem for trace
234 compilers. This is the main cause for "trace explosion".
235 <b>Hyperblock scheduling</b> promises to solve this nicely at the
236 price of a major redesign of the compiler. This would also pave the
237 way for emitting predicated instructions, which is a prerequisite
238 for efficient <b>vectorization</b>.
239 </li>
240 </ul>
241 <br class="flush">
242 </div>
243 <div id="foot">
244 <hr class="hide">
245 Copyright &copy; 2005-2011 Mike Pall
246 <span class="noprint">
247 &middot;
248 <a href="contact.html">Contact</a>
249 </span>
250 </div>
251 </body>
252 </html>