Beagle not finding pdfs, odt, etc... in 11.2

Hello,

I had beagle up and running in under Suse 11.0 with no issues, basically it would do a great job of searching within openoffice files, filenames, within pdfs, within thunderbird emails, etc…

I have added the appropriate file locations, I have everything selected to index, AND the service is running.

Beagle isnt getting into the filenames, within pdfs, or openoffice items at all.

Any Suggestions?
Thank you for your time in this matter.

qu1nn

anyone?

Thanks

qu1nn

I don’t have this issue in 11.2, x86_64.

However, you could try to run “beagle-extract-content” on one of the pdf files and see if it can find a filter for it.

Second, you could check in /etc/beagle/crawl-rules if pdf is enabled (system wide setup).

Third, you could check in $HOME/.beagle/config for suspicious excludes.

Question: do you use the Thunderbird backend of beagle ? This is what doesn’t return any result in my case … :\

sorry for not responding sooner. Hopefully today I will get back into the beagle issue that I am having:

my searching in thunderbird works fine, however
how do I tell if I am ACTUALLY using the thunderbird backend of beagle (i didnt see anything in the EDIT > PREFERENCES

YAST> Software does have the following installed:
Mozilla Thunderbird (3.0.0-1.1.1)
beagle-thunderbird (0.3.9-9.10.4)

regards
qu1nn

My intent is to search /archive and /media/ebooks
specifically file names, within (pdf, odt, xls, doc, txt, etc)

In the beagle search gui (0.3.9), SEARCH > PREFERENCES:

SEARCHING TAB: all items selected:
Start search & Indexing services automatically
Index while on battery power
index more aggressively while screensaver is activated

IDEXING TAB: Selected to index my home directory and additional:
/home/qu1nn
/archive
/media/ebooks

Privacy :
Path: $HOME/tmp
Path: $BEAGLE_HOME/tmp
Pattern(s): ~, ##, *.o, *.a, *.S, *.la, *.lo, *.loT, *.so, *.exe, *.dll, *.mdb, *.com, *.csproj, *.dsp, *.dsw, *.m4, *.pc, *.pc.in, *.in.in, *.omf, *.aux, .tmp, autom4te.cache, po, aclocal, Makefile, Makefile.am, Makefile.in, CVS, SCCS, conftest, conftest., /conf[0-9]+.sh/, /conf[0-9]+.file/

MAIL FOLDER (s): spam, outbox, trash, drafts, templates

DATA SOURCES TAB: Selected:
Thunderbird
Files
Kmail
Applications
documentation
manpages
locate
monodoc
windows

NETWORK OPTIONS TAB: nothing is selected

From what I see, the settings in the gui dont seem to match those in /etc/beagle/crawl-rules, as shown below:

/etc/beagle/crawl-rules/crawl-applications
CRAWL_ENABLED=“yes”
CRAWL_INDEX_NAME=“applications”
CRAWL_PATHS="/usr/share/applications /usr/local/share/applications /opt/gnome/share/applications /opt/kde3/share/applications"
CRAWL_RECURSIVE=“yes”
CRAWL_DISABLE_FILTERING=“no”
CRAWL_DISABLE_DIRECTORIES=“yes”
CRAWL_DISABLE_ON_BATTERY=“yes”
CRAWL_CACHE_TEXT=“no”
CRAWL_REMOVE_DELETED_FILES=“yes”

/etc/beagle/crawl-rules/crawl-documentation
CRAWL_ENABLED=“yes”
CRAWL_INDEX_NAME=“documentation”
CRAWL_PATHS="/usr/share/doc /usr/local/share/doc /opt/kde3/share/doc /opt/kde3/share/doc-bundle /opt/gnome/share/gnome/help /usr/share/gnome/help /usr/share/gnome/help-bundle /opt/gnome/share/gtk-doc/html /usr/share/gtk-doc/html /usr/share/gnome/html"
CRAWL_ALLOW_PATTERNS=".xml,.html,.docbook,.pdf"
CRAWL_RECURSIVE=“yes”
CRAWL_DISABLE_FILTERING=“no”
CRAWL_DISABLE_DIRECTORIES=“yes”
CRAWL_DISABLE_ON_BATTERY=“yes”
CRAWL_CACHE_TEXT=“no”
CRAWL_REMOVE_DELETED_FILES=“yes”

/etc/beagle/crawl-rules/crawl-executables
CRAWL_ENABLED=“no”

Use the same index as the applications index

CRAWL_INDEX_NAME=“applications”
CRAWL_PATHS="/bin /sbin /usr/bin /usr/sbin /usr/local/bin/ /usr/local/sbin"

Only index files in the gives directories

CRAWL_RECURSIVE=“no”

Don’t index content

CRAWL_DISABLE_FILTERING=“yes”
CRAWL_DISABLE_DIRECTORIES=“yes”
CRAWL_DISABLE_ON_BATTERY=“yes”
CRAWL_CACHE_TEXT=“no”
CRAWL_REMOVE_DELETED_FILES=“yes”

/etc/beagle/crawl-rules/crawl-manpages
CRAWL_ENABLED=“yes”
CRAWL_INDEX_NAME=“manpages”
CRAWL_PATHS="/usr/share/man/man1 /usr/share/man/man2 /usr/share/man/man3 /usr/share/man/man4 /usr/share/man/man5 /usr/share/man/man6 /usr/share/man/man7 /usr/share/man/man8 /usr/share/man/man9 /usr/share/man/mann /usr/share/info"
CRAWL_RECURSIVE=“yes”
CRAWL_DISABLE_FILTERING=“no”
CRAWL_DISABLE_DIRECTORIES=“yes”
CRAWL_DISABLE_ON_BATTERY=“yes”
CRAWL_CACHE_TEXT=“no”
CRAWL_REMOVE_DELETED_FILES=“yes”

/etc/beagle/crawl-rules/crawl-manpages
CRAWL_ENABLED=“no”
CRAWL_INDEX_NAME=“monodoc”
CRAWL_PATHS="/usr/lib/monodoc/sources /usr/local/lib/monodoc/sources"

Indexing Mono.zip will take a long time. It might be a good idea to add Mono.zip to deny patterns.

CRAWL_DENY_PATTERNS=".source,.tree"
CRAWL_RECURSIVE=“yes”
CRAWL_DISABLE_FILTERING=“no”
CRAWL_DISABLE_DIRECTORIES=“yes”
CRAWL_DISABLE_ON_BATTERY=“yes”
CRAWL_CACHE_TEXT=“no”
CRAWL_REMOVE_DELETED_FILES=“yes”

suggestions?
regards and thanks for your time in this
qu1nn

still at a loss: Here is a copy of ~//.beagle/Log … There are several files, not sure which to pick:

CURRENT-BEAGLE:
20091226 16:12:06.8195 03836 Beagle INFO: Starting Beagle Daemon (version 0.3.9)
20091226 16:12:06.8782 03836 Beagle INFO: Running on Mono 2.4.2.3
20091226 16:12:06.9476 03836 Beagle INFO: Command Line: /usr/lib64/beagle/BeagleDaemon.exe --replace --bg
20091226 16:12:08.7228 03836 Beagle WARN: Found a dangling index lock on /home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-0b4be62ddabcdfcc83b8691eda76a2b7-write.lock.
20091226 16:12:08.7237 03836 Beagle WARN: Found a dangling index lock on /home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-a92a5ffb4516b03be63a247727e81146-write.lock.
20091226 16:12:08.7828 03836 Beagle WARN: Indexes verified in .06s. Deleting stale lock files.
20091226 16:12:08.7847 03836 Beagle WARN: Could not delete lock files.
20091226 16:12:08.7947 03836 Beagle ERROR EX: Caught exception while instantiating Files backend
20091226 16:12:08.7947 03836 Beagle ERROR EX: System.Reflection.TargetInvocationException: Exception has been thrown by the target of an invocation. —> System.UnauthorizedAccessException: Access to the path “/home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-0b4be62ddabcdfcc83b8691eda76a2b7-write.lock” is denied.
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.File.Delete (System.String path) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.Directory.Delete (System.String path, Boolean recursive) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneCommon.Create () [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryingDriver…ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.LuceneNameResolver…ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable.BuildLuceneQueryingDriver (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable…ctor (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable…ctor (System.String index_name, Int32 minor_version) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable…ctor () [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at (wrapper managed-to-native) System.Reflection.MonoCMethod:InternalInvoke (object,object],System.Exception&)
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object] parameters, System.Globalization.CultureInfo culture) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: — End of inner exception stack trace —
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object] parameters, System.Globalization.CultureInfo culture) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object] parameters, System.Globalization.CultureInfo culture) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.ConstructorInfo.Invoke (System.Object] parameters) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type, Boolean nonPublic) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type) [0x00000]
20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.QueryDriver.ScanAssemblyForQueryables (System.Reflection.Assembly assembly) [0x00000]
20091229 00:40:12.7061 03836 Beagle INFO: Shutdown requested
20091229 00:40:12.7426 03836 Beagle INFO: Beagle daemon process shut down cleanly.

current-BEAGLE-EXTENSIONS:
20091229 07:59:15.4060 03739 Beagle ERROR EX: Caught exception while instantiating Files backend
20091229 07:59:15.4060 03739 Beagle ERROR EX: System.Reflection.TargetInvocationException: Exception has been thrown by the target of an invocation. —> System.UnauthorizedAccessException: Access to the path “/home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-0b4be62ddabcdfcc83b8691eda76a2b7-write.lock” is denied.
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.File.Delete (System.String path) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.Directory.Delete (System.String path, Boolean recursive) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneCommon.Create () [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryingDriver…ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.LuceneNameResolver…ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable.BuildLuceneQueryingDriver (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable…ctor (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable…ctor (System.String index_name, Int32 minor_version) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable…ctor () [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at (wrapper managed-to-native) System.Reflection.MonoCMethod:InternalInvoke (object,object],System.Exception&)
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object] parameters, System.Globalization.CultureInfo culture) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: — End of inner exception stack trace —
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object] parameters, System.Globalization.CultureInfo culture) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object] parameters, System.Globalization.CultureInfo culture) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.ConstructorInfo.Invoke (System.Object] parameters) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type, Boolean nonPublic) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type) [0x00000]
20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.QueryDriver.ScanAssemblyForQueryables (System.Reflection.Assembly assembly) [0x00000]

I am at a loss.
regards
qu1nn

Items installed:
Need extended attributes
multipe drives
/dev/sda2 [ext4] / fstab options: extended user attributes selected
data journaling mode = ordered
Access Control Lists selected

/dev/sda3 [ext4] /home (fstab options are the same as above)
/dev/sdb1 [ext4] /archive (fstab options are the same as above)
/dev/sdc1 [fat] /media/disk

Installed Software [version]:
openSUSE 11.2 (x86_64)
KDE 4.3.1 release 6

beagle-gui [0.3.9-9.10.4]
beagle-firefox [0.3.9-9.10.4]
beagle-index [11.2_20090721-3.12]
beagle-thunderbird [0.3.9-9.10.4]
beagle-xesam [0.2-145.5]

perl-Linux-Inotify2 [1.1-131.1]
perl-Linux-Inotify2-debuginfo [1.1-131.1]
perl-Linux-Inotify2-debufsource [1.1-131.1]
python-pyinotify [0.8.1-0.pm.1.1]

libbeagle-devel [0.3.9-3.3]
libbeagle-doc [0.3.9-3.3]
libbeagle-python [0.3.9-3.3]
libbeagle-python-debuginfo [0.3.9-3.3]
libeagle1 [0.3.9-3.3]

Mono-core [2.4.2.3-2.3]
Mono-data [2.4.2.4-2.3]
Mono-data-sqlite [2.4.2.4-2.3]
mono-devel [2.4.2.4-2.3]
mono-web [2.4.2.4-2.3]

glib-sharp2 [2.12.9-23.6]
NDesk-Dbus [0.6.0-65.3]
gmime [2.2.23-4.4]
sqlite3 [3.6.16-2.3]
taglib-sharp [2.0.3.3-1.1.2]

anyone have any pointers?

qu1nn

Please try the following:

1. Search path [as root]

Delete all not valid path entries in /etc/beagle/crawl-rules/crawl-documentation and add the paths to be crawled in addition. Note: Paths you already added in the GUI Kerry will not be written in the /etc/beagle/crawl-rules/crawl-documentation file. I think it’s a problem of the access rights.

2. Old indexes [as user [i]user]

Delete all in the folder /home/user/.beagle/Indexes

3. New indexes [as user [i]user]

Log off and on your session. Beagle will start to build new indexes including your desired files.

Good luck!

albschrat