Official feedback on OpenGL 4.0 thread

RoderickC · March 11, 2010, 2:34pm

Khronos_webmaster:

Someone posted a comment on the OpenGL homepage news story. Hopefully they won’t mind if I move the comment over here so it can get some clarification and worthy feedback:

First, congratulations on the milestone. Second, as a cross-platform developer I would like to use OpenGL exclusively but it’s commercially inviable to use it on Windows, due to the fact that OpenGL just doesn’t work on most machines by default, which forces me to target my game to both DirectX and OpenGL. The OpenGL shortcomings on Windows aren’t a big deal for hardcore games where the users are gonna have good drivers (although they are a cause of too many support calls which makes it inviable anyway) but it’s a show-stopper for casual games.

If it was me in charge of OpenGL I wouldn’t even bother coming up with new versions of the standard until this situation was rectified. I really don’t understand why the effort isn’t put in working with Nvidia, Ati and Microsoft to rectify the situation. If Microsoft wont help just bring back glsetup! Having to run an installer on installation of your product would be fine. But currently the user has to find good drivers by himself and then reboot, this is unacceptable. DirectX on the other hand can be streamlined as part of the installation. Until OpenGL isn’t expected to just work in any Windows box it is dead in the Windows platform. Do something about this please.

Posted by: Margott

Something like 3dsetup would indeed be a good short-term solution to provide people with modern OpenGL drivers. Too bad that Intel isn’t shipping any GL3.x let stand 4.x drivers at this point

Another thing which is VERY related to this. A lot of developers encounter OpenGL driver bugs. Some implementations are buggier than others and this frustrates a lot of developers and it is also one of the reasons that some companies don’t use OpenGL at this point.

In order to improve OpenGL driver quality I would urge developers when they encounter problems, to submit test cases to the ‘piglit’ an opengl testing framework hosted at Piglit - OpenGL driver testing framework. At least open source X.org OpenGL drivers are using it as a test bed but nothing prevents OSX/Windows developers to use it as well.

Roderick

Ilian_Dinev · March 11, 2010, 3:20pm

That’s too counter-productive, imho. Scenes have 5k+ objects visible, different textures each; fewer programs. It makes more sense to group by program, imho.
In case you meant to bind all N textures for the given mesh-instance at once, I don’t think it’s viable either: those GLuint names are not optimally mappable during shader execution (are not pointers, and should not be).

Rob_Barris · March 11, 2010, 3:38pm

Alfonse_Reinheart:

I really don’t understand why the effort isn’t put in working with Nvidia, Ati and Microsoft to rectify the situation.

Because none of them are responsible for it. ATI’s drivers are OK, and NVIDIAs are pretty good. The problem is Intel. Their drivers are terrible. And if you’re making casual games, you want to be able to play them on Intel integrated chipsets.

The idea of having a textures and samplers separate is so that you can bind the texture object directly to the shader uniform/uniform buffer so you didn’t have to play music texture units when you change the shader (rebinding the right texture object to the right texture unit).

No it isn’t. The purpose of separation is exactly what is said in the overview of the spec: to be able to use the same texture data with different filtering parameters without having to do a bunch of texture parameter state changes. This is very valuable.

And personally, I’ve come to rather like the game of “music texture units”: it allows me to change exactly what state I want. I can use the same shader with different textures very, very easily.

Under the system you’re suggesting, I have to change the program itself to swap texture sets. This is more expensive, both in terms of the number of function calls and how the hardware handles it.

Personally, what I would like to see is full separation of program objects (compiled/linked programs) from program object state (uniforms, etc). UBOs are about as close to that as it gets, so I’m fairly content with that compromise.

Progress!

What kind of GL app are you working on ?

mhagain · March 11, 2010, 4:18pm

Yeah, it’s wishful thinking on my part. But hey, imagine an Intel spokesperson announcing that, “we are planning to ship OpenGL 3.3 support by the end of May!”

Of course, reality kicks soon, when someone asks for the 100th time on how to perform offscreen rendering:

User trying to create an invisible window or other broken hacks: “Why do I keep getting garbage back?”
Linking to the OpenGL FAQ: “You are not passing the pixel ownership test. Use FBOs.”
“But FBOs don’t work on Intel.”
“Try pbuffers.”
“Nope, no pbuffers either.”
“How about a sacrifice to Kthulu?”
“Wha…?”
“Just kidding. Software rendering for you.”

That much for “high performance graphics” on Intel… [/QUOTE]
Meanwhile I’ve been quite happily using SetRenderTarget with D3D9 on Intel chips going back to the 915 without a problem.

The annoying thing is that the hardware actually does support hardware accelerated offscreen rendering perfectly well.

Tomcat · March 11, 2010, 6:38pm

In section 1.2.1 of the GLSL 3.3 spec (Summary of Changes from Version 1.50) it says
“Added Appendix A to describe include tree and path semantics/syntax for both the language and the API specifications.”
This appendix or any other information does not appear to be in the GLSL spec or the GL 3.3 spec. The related extension (ARB_shading_language_include) says
“We decided not to put #include into OpenGL 3.3 / 4.0 yet”

Mars_999 · March 11, 2010, 10:06pm

In a word, “SWEET!” I love the new direction the ARB has taken with OpenGL! Keep it coming.

BTW as for setup of GL4.0, I haven’t read the spec, but I am assuming its no different to setup than GL3.2 and similar usage vs. GL3.2?

Thanks

bertgp · March 11, 2010, 10:51pm

Great step forward! I just hope drivers will implement all these features reliably. A spec conformity test suite (à la ACID tests for browsers) would be extremely useful for this.

I know at least 6 developer at my co. that want the ability to separate shader objects and have a binary shader format. Maybe the shader subroutines will help, depending on their performance.

DSA would be a nice to have, but not imperative since we wrapped all the object binding logic in classes.

Command lists as BarnacleJunior suggested would also be very useful. They would allow to get the maximum efficiency in the OpenGL draw thread since it would only execute a compiled list of OpenGL commands; kind of like a display list for each frame or each part of a frame.

CrazyButcher · March 12, 2010, 3:35am

I agree to look in the future now. OpenGL vs dx9 on dx9 class hardware (most intel integrated stuff) was clearly lost, FBO came too late, GLSL was also a bit dogy compared to the dx9 sm3 (and even compared to the arb program extensions).
So one should not try to fix the past up, that’s just too much legacy, and not worth the effort. But for the sm4+ hardware things look different now, with both apis very close feature wise, and one being able to expose that functionality on all platforms, including win xp.

I am not sure what the mobile guys work on, but given the lean nature of the “core” profiles, I would think that GL ES might not be needed anymore, for the next-gen mobile stuff.

out of curiosity, is there a clear benefit of the “link” mechanism GLSL has (vs the dx like individual shaders) for the IHVs? In theory additional optimization could be done, but is this really being made use of?

randall · March 12, 2010, 6:03am

To all those requesting DSA: write wrapper classes for OpenGL resources and you have DSA. Works great when done well.

Tomasz_D_261_browski · March 12, 2010, 6:22am

To all those requesting anything from OpenGL: write your own software renderer and you have it. Works great when done well.

Binding/state system have no benefits. It is a minor problem for IHV and major problem for game programmers.
It was especially awful in FF days where every routine had to look like:

glBindALotOfThings();
glSetALotOfStates();
glDoSomethingUseful();
glSetEverythingBack();

to ensure that 2 different modules won’t overwrite their settings. Now with shaders (no state machine there!) and VAO and other fancy stuff there aren’t as much “binding places”, but for example binding textures (now with additional sampler objects) and UBOs are still cumbersome.

Less API calls == better performance == profit.

Aleksandar · March 12, 2010, 6:22am

Randall, do you know what does DSA mean? The point is that…

The intent of this extension is to make it more efficient for libraries to avoid disturbing selector and latched state.
… and you suggest to make a wrapper…

glfreak · March 12, 2010, 7:00am

Glad to hear GL spec 4.0 is out.

“Functional” Drivers?

Intel graphics?

danbartlett · March 12, 2010, 8:19am

Are extensions like:

http://www.opengl.org/registry/specs/ARB/draw_buffers_blend.txt
http://www.opengl.org/registry/specs/ARB/sample_shading.txt

going to be modified to remove ARB suffix from tokens + entry points? (Otherwise the headers will need to include all these again, without the ARB suffix)

ScottManDeath · March 12, 2010, 9:24am

Can you elaborate on “dropped” ?

DX11 cards still need to be able to run DX9/DX10 software, so I don’t see how this feature could be cut from silicon unless it has simply become another programmable behavior masquerading as FF behavior… or do you mean that it’s just not in the DX11 API any more. [/QUOTE]

I blame my bad memory for making me think that some additional restrictions introduced in DX10.1 meant that dual source blending is getting the shaft. :o

Jan · March 12, 2010, 9:26am

Well said, Aleksandar. DSA is about making an API more stream-lined and EFFICIENT. Sure, if you use glGet* and push/pop before EVERY state-change, you can make it work the same way, even today.

But then don’t complain about slow rendering. Multi-threading is then completely impossible for the driver to accomplish.

Jan.

Alfonse_Reinheart · March 12, 2010, 11:10am

Binding/state system have no benefits.

That’s not why it’s still around.

Less API calls == better performance == profit.

That’s not necessarily true. It can be true, but it certainly doesn’t have to be.

going to be modified to remove ARB suffix from tokens + entry points?

They didn’t do it when ARB_geometry_shader4 was promoted to core, so I doubt they’ll start now.

Core extensions (ARB extensions without the suffix) are something of a nicety. They aren’t 100% necessary, but they’re nice to have when possible. It certainly isn’t worth rewriting an extension specification just to have them, though.

Multi-threading is then completely impossible for the driver to accomplish.

This is probably the best argument for DSA. You can’t have multithreaded rendering without it.

However, the problem is that, even if you use DSA, backwards compatibility means that you don’t have to. What then happens to multithreaded rendering in that case? Does the spec just say, “attempting to call functions X, Y, Z will cause undefined behavior when threading?”

randall · March 12, 2010, 12:13pm

Yes, I know what DSA means. I was not talking about using glGet* and push/pop before EVERY state-change. I know that this would kill performance. I was talking about caching the most important state in app on the CPU side (tracking binding points etc.)

I agree that DSA would be nice and more efficent. But the reality is that we don’t have it in the OpenGL 4.0.

I suggested to create thin layer (wrapper for OpenGL resources) which would “emulate DSA” for non NVIDA hardware and use fast path (DSA) for NV hardware. I have written such abstraction and it works well. So, do not complain. Be happy with OpenGL 4.0. It’s getting better and better.

Gedolo · March 12, 2010, 12:31pm

Very pleasantly surprised by this OpenGL release.
Love the new stuff.

The drawing without cpu intervention is fantastic!
This saves a lot of valuable cpu-cycles.
Makes OpenGL very efficient
Good to see instancing going further.
Those timer query stuff is going to be really handy.
This makes it able for programs to do a mini benchmark.
Add to this the new shader subroutine flexibility.
It’s going to be possible to write programs that might optimize themselves dynamically at runtime.

The only thing that’s missing is DSA.
+1 for the DSA.

This is a very good release with a lot of nice goodies.
Khronos is really improving OpenGL very well. Kudos for that.

Aleksandar · March 12, 2010, 12:43pm

I’m sorry randall for misunderstood.
And, of course, I’m happy with both OpenGL and NV.

Alfonse_Reinheart · March 12, 2010, 12:52pm

The drawing without cpu intervention is fantastic!
This saves a lot of valuable cpu-cycles.

It’s not there to save performance. What it does do is allow a shader that does transform feedback to decide how to do the rendering with the feedback data by itself.

Good to see instancing going further.

I was actually rather surprised to see them put that form of instancing back in the rendering pipeline. Especially since D3D took it out in version 10 (as I understand it).

The only thing that’s missing is DSA.

coughshader separationcough.

I’m not using tessellation until I can runtime mix and match shaders as I see fit without having to re-link and everything.