Rawhide Watch

Daily warnings for rawhide victims

GNOME > System > Shutdown fails to Shutdown?

Posted by rawhidewatch on April 17, 2009

In the past week I’ve noticed the following strange behavior on multiple Rawhide machines.

  • I am logged into the GNOME desktop.
  • I select System > Shutdown > Shutdown
  • X dies, the shutdown animation happens, but it just stops.
  • Ctrl-Alt-F2 switches to a VT, but typing does nothing because the getty is dead.
  • Ctrl-Alt-Delete starts the reboot process, where we see things actually go through the shutdown process with services stopping.

Shutdown from GDM, or shutdown immediately after logging into GNOME do not exhibit this behavior.  I am told that Bug #495326 is related to this issue.

Posted in Uncategorized | Comments Off

ext4 on Intel X25-M SSD needs nodelalloc

Posted by rawhidewatch on April 13, 2009

Many early SSD owners paid hundreds of dollars for a solid state drive disk, only to realize that performance is terrible with “stuttering” caused by terrible SSD controllers made by JMicron and its inability to handle random write patterns.  Intel’s X25-M and 25-E drives have consistently beat all other consumer grade SSD drives due to its superbly designed SSD controller, but at a huge price premium.  Generally users of Intel’s SSD’s have avoided the “stuttering” problems of lesser quality drives.

Unfortunately, it seems that 15-60 seconds of seemingly deadlock-like non-responsiveness can happen even with an Intel X25-M when using ext4 filesystem.  Ever since I upgraded my Thinkpad T60 laptop to an Intel X25-M 160GB drive, my entire system would feel like it freezed, but then I notice the hard drive light flashing intermittently.  After maybe 30-60 seconds the hard drive light would be solid on for a few seconds, then the system would recover.  I cannot figure out how to reproduce this on-demand, but it seems to happen 2-3 times a day, and only during times where disk activity is very low.

In any case, this problem goes away entirely if I add “nodelalloc” to the mount flags of my ext4 filesystems.  This turns off ext4 delayed allocation, which can typically delay writing to the disk for longer periods than ext3′s 5 seconds, perhaps 30-60 seconds.  This is a benefit to hard drives because you can save power by spinning up less often.  But for Intel”s X25-M drives, you are better off writing as early and often as possible since the drive doesn’t spin (no power benefit), and it reportedly handles parallelism internally very well, meaning the OS shouldn’t worry about it because there is no performance benefit.

In related news, it was suggested to me that elevator=noop is useful in your kernel cmdline in grub.conf if your disks are SSD.  While I am personally trying it, I am uncertain if this is a good thing.  While the elevator normally wouldn’t be necessary with a theoretically perfect solid state disk with constant-time random access to any address,  SSD’s are not perfect with constant time access.  Especially in the case of writing many small changes to non-contiguous parts of the disk, SSD’s suffer from what is known as write amplification because SSD’s need to read, erase and re-write entire blocks if you change only a tiny portion of that block.  I wonder if write-combining that happens in the elevator prevents a few I/O writes to non-adjacent but nearby blocks from triggering multiple write events to the same SSD block.  If it turns out the elevator does help to minimize write amplification before it gets to the SSD’s controller, then it would be both a performance benefit as well as helping the longevity of the disk.  I have no clue how to measure this.  Perhaps we need Intel engineers + people who know filesystems to answer this.

In any case, this ext4 behavior with Intel X25-M seems to be a bug in ext4?  I don’t know.

UPDATE:
Jeff Moyer pointed out that recent versions of the CFQ elevator has smart SSD detection.  If SSD is detected, it disables idling during random access reads because the penalty for reading is not severe.  He thinks you are better off using the default CFQ scheduler with these drives.  Jeff also pointed out that elevator=noop will also do write combining.

UPDATE April 17th,2009:
Intel released a firmware update for their X25-M and X18-M SSD’s that improves performance in the worst case scenario.  I flashed my X25-M and turned off nodelalloc to see if this makes a difference.  It seems ext4 still behaves badly without nodelalloc with occasional episodes of 15-60 seconds of unresponsiveness.

Posted in Uncategorized | Comments Off

Video with weird delays?

Posted by rawhidewatch on April 10, 2009

F-11 rawhide users might have noticed weird behaviors with video playing in the past months.  Playing videos in Youtube, mplayer or totem would often play with audio immediately, but you see no video for a few seconds.  Then suddenly many video frames rapidly play, then video becomes smooth in sync with the audio.  It turns out that this was actually problems with video syncing to the audio clock of pulseaudio.  The latest build pulseaudio-0.9.15-9+ substantially improves this weird problem at the beginning of videos, and seems to make video playback smoother as well.  Note that if you are using mplayer, you need this patch to be applied to remove a workaround hack, otherwise you might not notice any improvement.  Hopefully distributors of mplayer will patch soon.

Posted in Uncategorized | 1 Comment »

Rawhide x86_64 firefox and rpm broke? Workaround Procedure

Posted by rawhidewatch on April 8, 2009

Some rawhide x86_64 users have hit a problem where firefox and rpm were broken by a dependency problem.  Adam Williamson wrote:


https://bugzilla.redhat.com/show_bug.cgi?id=494122

The nss-3.12.2.99.3-5 package introduced a dependency on nss-softokn-freebl. However, this dependency was not architecture specific: it could be satisfied by the i586 or x86-64 package. Depending on what packages they had installed, x86-64 users could wind up with only the i586 nss-softokn-freebl package, which broke nss, and – in consequence – at least Firefox and also rpm.

The nss-3.12.2.99.3-7 update fixes this by making the dependency arch-specific. If you have been affected by the issue – you’ll know if
you’re running x86-64 Rawhide and your Firefox and rpm don’t work, and  you can verify by checking what nss-softokn-freebl packages are installed – you can get out of the jam by downloading the x86-64 nss, nss-tools and nss-softokn-freebl packages from a Rawhide mirror and running:

cd /
rpm2cpio PATH_TO_RPM/nss-3.12.2.99.3-7.fc11.`uname -m`.rpm | cpio -i
rpm2cpio PATH_TO_RPM/nss-softokn-freebl-3.12.2.99.3-7.fc11.`uname -m`.rpm | cpio -i
rpm2cpio PATH_TO_RPM/nss-tools-3.12.2.99.3-7.fc11.`uname -m`.rpm | cpio -i

Posted in Uncategorized | Comments Off

Broken typing in gnome-terminal tabs and Firefox?

Posted by rawhidewatch on March 30, 2009

Are you seeing any of these weird issues?

  • Firefox: Spacebar no longer scrolls pages down.
  • Firefox: In a HTML text area box like the component selection here in Bugzilla, if I want to choose ibus, I would click in the box and type “ibus”.  Typing however does nothing.
  • gnome-terminal: Open one tab. Type. Open second tab. Type. Go back to the first tab. Typing appears to be broken, although ENTER and Backspace work.  Typing went into the second tab.

This was broken by ibus-1.1.0.20090311-3.fc11 in Bug #492956. Try the latest build of ibus.

Posted in Uncategorized | Comments Off

Beta freeze lifted

Posted by rawhidewatch on March 28, 2009

Jesse Keating just posted that the Beta release has been composed, and thus rawhide will unfreeze.  Tomorrow’s rawhide will have all packages pent up in dist-f11.

Posted in Uncategorized | Comments Off

Workaround for Thunderbird 3 beta huge IMAP folders 100% CPU hangs

Posted by rawhidewatch on March 27, 2009

Upgraded from thunderbird-2.something to 3.0 beta 2. I have very large IMAP SSL folders. Thunderbird seems to get stuck using 100% CPU and becomes entirely unresponsive for minutes at a time, sometimes doesn’t ever recover.


https://wiki.mozilla.org/MailNews:Logging
export NSPR_LOG_MODULES=imap:5
export NSPR_LOG_FILE=/tmp/imap.log

imap.log grew to 700MB in about 20 minutes. Curiously, CPU usage of thunderbird remained low and it did not get “stuck” for extended periods of time.

export NSPR_LOG_MODULES=imap:5
export NSPR_LOG_FILE=/dev/null

This workaround seems to be make Thunderbird usable again. Thunderbird’s UI still freezes for a few seconds, but Thunderbird 2 used to do that too.

Update August 26th, 2009: Thunderbird remains crippled in this regard.  This bug seems to have several people experiencing the problem.   The above workaround definitely did work back in March, but doesn’t seem to be helping now for me.  Instead it seems to help if you uncheck all of your folders for Synchronization.  I personally synchronize only Inbox and Sent and uncheck every other folder.

Posted in Uncategorized | Comments Off

Slow vncviewer? Enable composite

Posted by rawhidewatch on March 26, 2009

Apparently vncviewer is REALLY SLOW, rendering something in 2D that causes X to use 100% CPU with i945 video (possibly other cards too?)

Enable compiz and vncviewer will be suddenly fast again.  Adam Jackson reportedly is working on optimizing intel driver performance.  He says rendering directly to the front buffer is bad right now.  This is being tracked in Bug #490839.

Posted in Uncategorized | Comments Off

Launch of Rawhide Watch

Posted by rawhidewatch on March 24, 2009

This blog tracks daily news about notable things to watch out for if you use rawhide. If something important in rawhide is broken, this blog should warn users about it and suggest workarounds.  During the lull after stable releases, notable things users should know about Fedora updates stable may also be written here.

Please submit notable news to Warren Togami and it may appear here.

Posted in Uncategorized | Comments Off

 
Follow

Get every new post delivered to your Inbox.