Actually `strsplit` uses grep patterns as well. (A comma is a regex metacharacter whereas a space is not; hence the need for double escaping the commas in the pattern argument. So the use of `&quot;\\s&quot;` would be more to improve readability than of necessity):

    &gt; strsplit(test_1, &quot;\\, |\\,| &quot;)  # three possibilities OR&#39;ed
    [[1]]
    [1] &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;
    
    &gt; strsplit(test_2, &quot;\\, |\\,| &quot;)
    [[1]]
    [1] &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;

Without using both `\\,` and `\\, ` (note extra space that SO does not show) you would have gotten some character(0) values. Might have been clearer if I had written:

    &gt; strsplit(test_2, &quot;\\,\\s|\\,|\\s&quot;)
    [[1]]
    [1] &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;

@Fojtasek is so right: Using character classes often simplifies the task because it creates an implicit logical OR:

    &gt; strsplit(test_2, &quot;[, ]+&quot;)
    [[1]]
    [1] &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;
    
    &gt; strsplit(test_1, &quot;[, ]+&quot;)
    [[1]]
    [1] &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;




In case you don&#39;t like regular expressions, you can call `strsplit()` multiple times:

	strsplits &lt;- function(x, splits, ...)
	{
		for (split in splits)
		{
			x &lt;- unlist(strsplit(x, split, ...))
		}
		return(x[!x == &quot;&quot;]) # Remove empty values
	}
	
	strsplits(test_1, c(&quot; &quot;, &quot;,&quot;))
	# &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;
	strsplits(test_2, c(&quot; &quot;, &quot;,&quot;))
	# &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;


**Updated** for the added example

	strsplits(test_1, c(&quot;[[:punct:]]&quot;,&quot;[[:space:]]&quot;))
	# &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;
	strsplits(test_2, c(&quot;[[:punct:]]&quot;,&quot;[[:space:]]&quot;))
	# &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;

But if you are going to use regular expressions, you might as well go with @DWin&#39;s approach:


	strsplit(test_1, &quot;[[:punct:][:space:]]+&quot;)[[1]]
	# &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;
	strsplit(test_2, &quot;[[:punct:][:space:]]+&quot;)[[1]]
	# &quot;abc&quot; &quot;def&quot; &quot;ghi&quot; &quot;klm&quot;

You could go with `strsplit(test_1, &quot;\\W&quot;)`.


    

     test_1&lt;-&quot;abc def,ghi klm&quot;
     test_2&lt;-&quot;abc, def ghi klm&quot;
     key_words &lt;- c(&quot;abc&quot;,&quot;def&quot;,&quot;ghi&quot;)
     matches &lt;- str_c(key_words, collapse =&quot;|&quot;)
     str_extract_all(test_1, matches)
     str_extract_all(test_2, matches)

Can I get a file name or its path from a `fstream` object? I looked through the methods of `fstream` and didn&#39;t find anything close to it.

Getting filename (or path) from fstream

I have a web application project.  I generated the DLL and import it in another project.  I implemented `VirtualPathProvider`. 

I followed this web site: http://support.microsoft.com/kb/910441/en-us?spid=8940&amp;sid=global, and everything works until I add another site master.

1. I added `site_export.master` and changed its Build Action to Embedded Resource.
2. I changed my page to use the new site master.
3. `GetManifestResourceStream()` returns `null` when I load `site_export.master`.
4. I call `GetManifestResourceNames()` to check if `site_export.master` exists in the DLL and it does.  It&#39;s in the list. All of the name spaces match. I didn&#39;t list the name space out here.

Why can&#39;t `GetManifestResourceStream()` load my new `site_export.master`?  It loads `site.master` just fine.  I know my DLL is loaded because I can see other files in the DLL.

Why does GetManifestResourceStream returns null while the resource name exists when calling GetManifestResourceNames?

Given a character string

    test_1&lt;-&quot;abc def,ghi klm&quot;
    test_2&lt;-&quot;abc, def ghi klm&quot;

I wish to obtain

    &quot;abc&quot;
    &quot;def&quot;
    &quot;ghi&quot;

However, using strsplit, one must know the order of the splitting values in the string, as strsplit uses the first value to do the first split, the second to do the second... and then recycles.

But this does not:

    strsplit(test_1, c(&quot;,&quot;, &quot; &quot;))
    strsplit(test_2, c(&quot; &quot;, &quot;,&quot;))

    strsplit(test_2, split=c(&quot;[:punct:]&quot;,&quot;[:space:]&quot;))[[1]]

I am looking to split the string wherever I find any of my splitting values in a single step.

R strsplit with multiple unordered split arguments?

Given a character string
<pre><code class="hljs language-arduino">test_1&#x3C;-"abc def,ghi klm"
test_2&#x3C;-"abc, def ghi klm"
</code></pre>
I wish to obtain
<pre><code class="hljs language-arduino">"abc"
"def"
"ghi"
</code></pre>
However, using strsplit, one must know the order of the splitting values in the string, as strsplit uses the first value to do the first split, the second to do the second... and then recycles.
But this does not:
<pre><code class="hljs language-lua">strsplit(test_1, c(",", " "))
strsplit(test_2, c(" ", ","))

strsplit(test_2, split=c("[:punct:]","[:space:]"))[[1]]
</code></pre>
I am looking to split the string wherever I find any of my splitting values in a single step.

Problem here is a bit obvious I think. I&#39;d like the legend placed (locked) in the top left hand corner of the &#39;plotting region&#39;. Using c(0.1,0.13) etc is not an option for a number of reasons.

Is there a way to change the reference point for the co-ordinates so they are relative to the plotting region?

    mtcars$cyl &lt;- factor(mtcars$cyl, labels=c(&quot;four&quot;,&quot;six&quot;,&quot;eight&quot;))
    ggplot(mtcars, aes(x=wt, y=mpg, colour=cyl)) + geom_point(aes(colour=cyl)) + 
    opts(legend.position = c(0, 1), title=&quot;Legend placement makes me sad&quot;)


![enter image description here][1]

Cheers


  [1]: http://i.stack.imgur.com/i4H6s.png

Legend placement, ggplot, relative to plotting region

I have a table in R that has `str()` of this:

     table [1:3, 1:4] 0.166 0.319 0.457 0.261 0.248 ...
     - attr(*, &quot;dimnames&quot;)=List of 2
      ..$ x: chr [1:3] &quot;Metro &gt;=1 million&quot; &quot;Metro &lt;1 million&quot; &quot;Non-Metro Counties&quot;
      ..$ y: chr [1:4] &quot;q1&quot; &quot;q2&quot; &quot;q3&quot; &quot;q4&quot;

And looks like this when I print it:

                        y
    x                           q1        q2        q3        q4
      Metro &gt;=1 million  0.1663567 0.2612212 0.2670441 0.3053781
      Metro &lt;1 million   0.3192857 0.2480012 0.2341030 0.1986102
      Non-Metro Counties 0.4570341 0.2044960 0.2121102 0.1263597

I want to get rid of the `x` and `y` and convert it to a data frame that looks exactly the same as the above (three rows, four columns), but without the `x` or `y`. If I use `as.data.frame(mytable)`, instead I get this:

                        x  y      Freq
    1   Metro &gt;=1 million q1 0.1663567
    2    Metro &lt;1 million q1 0.3192857
    3  Non-Metro Counties q1 0.4570341
    4   Metro &gt;=1 million q2 0.2612212
    5    Metro &lt;1 million q2 0.2480012
    6  Non-Metro Counties q2 0.2044960
    7   Metro &gt;=1 million q3 0.2670441
    8    Metro &lt;1 million q3 0.2341030
    9  Non-Metro Counties q3 0.2121102
    10  Metro &gt;=1 million q4 0.3053781
    11   Metro &lt;1 million q4 0.1986102
    12 Non-Metro Counties q4 0.1263597

I probably fundamentally do not understand how tables relate to data frames.  

How to convert a table to a data frame

I have over 700 files in one folder named as:
files from number 1 to number9 are named for the first month:
 
    water_200101_01.img  
    water_200101_09.img  

files from number 10 to number30 are named:

    water_200101_10.img
    water_200101_30.img

 And so on for the second month:
files from number 1 to number9 are named:
 
    water_200102_01.img  
    water_200102_09.img  

files from number 10 to number30 are named:

    water_200102_10.img
    water_200102_30.img 

How can I rename them without making any changes to the files. just change the nams, for example    

    water_1
    water_2
    ...till...
    water_700

How do I rename files using R?

                                                                     
                                                                     
                                                                     
                                             
Main Question
---


I&#39;m having issues with understanding why the handling of dates, labels and breaks is not working as I would have expected in R when trying to make a histogram with ggplot2.

**I&#39;m looking for:**

- A histogram of the frequency of my dates
- Tick marks centered under the matching bars
- Date labels in `%Y-b` format
- Appropriate limits; minimized empty space between edge of grid space and outermost bars

I&#39;ve [uploaded my data to pastebin](http://pastebin.com/sDzXKFxJ) to make this reproducible. I&#39;ve created several columns as I wasn&#39;t sure the best way to do this:

    &gt; dates &lt;- read.csv(&quot;http://pastebin.com/raw.php?i=sDzXKFxJ&quot;, sep=&quot;,&quot;, header=T)
    &gt; head(dates)
           YM       Date Year Month
    1 2008-Apr 2008-04-01 2008     4
    2 2009-Apr 2009-04-01 2009     4
    3 2009-Apr 2009-04-01 2009     4
    4 2009-Apr 2009-04-01 2009     4
    5 2009-Apr 2009-04-01 2009     4
    6 2009-Apr 2009-04-01 2009     4

Here&#39;s what I tried:

    library(ggplot2)
    library(scales)
    dates$converted &lt;- as.Date(dates$Date, format=&quot;%Y-%m-%d&quot;)
   
    ggplot(dates, aes(x=converted)) + geom_histogram()
    +      opts(axis.text.x = theme_text(angle=90))

Which yields [this graph](http://i.imgur.com/rks0y.png). I wanted `%Y-%b` formatting, though, so I hunted around and tried the following, based on [this SO](https://stackoverflow.com/questions/10576095/formatting-dates-with-scale-x-date-in-ggplot2):

    ggplot(dates, aes(x=converted)) + geom_histogram()
    +    scale_x_date(labels=date_format(&quot;%Y-%b&quot;),
    +    breaks = &quot;1 month&quot;)
    +    opts(axis.text.x = theme_text(angle=90))
    
    stat_bin: binwidth defaulted to range/30. Use &#39;binwidth = x&#39; to adjust this.

That gives me [this graph](http://i.stack.imgur.com/HaGsV.png)

- Correct x axis label format
- The frequency distribution has changed shape (binwidth issue?)
- Tick marks don&#39;t appear centered under bars
- The xlims have changed as well

I worked through the example in the [ggplot2 documentation](http://cran.r-project.org/web/packages/ggplot2/ggplot2.pdf) at the `scale_x_date` section and `geom_line()` appears to break, label, and center ticks correctly when I use it with my same x-axis data. I don&#39;t understand why the histogram is different.

---

Updates based on answers from edgester and gauden
---

I initially thought gauden&#39;s answer helped me solve my problem, but am now puzzled after looking more closely. Note the differences between the two answers&#39; resulting graphs after the code.

Assume for both:

    library(ggplot2)
    library(scales)
    dates &lt;- read.csv(&quot;http://pastebin.com/raw.php?i=sDzXKFxJ&quot;, sep=&quot;,&quot;, header=T)

Based on @edgester&#39;s answer below, I was able to do the following:

    freqs &lt;- aggregate(dates$Date, by=list(dates$Date), FUN=length)
    freqs$names &lt;- as.Date(freqs$Group.1, format=&quot;%Y-%m-%d&quot;)
    
    ggplot(freqs, aes(x=names, y=x)) + geom_bar(stat=&quot;identity&quot;) +
           scale_x_date(breaks=&quot;1 month&quot;, labels=date_format(&quot;%Y-%b&quot;),
                        limits=c(as.Date(&quot;2008-04-30&quot;),as.Date(&quot;2012-04-01&quot;))) +
           ylab(&quot;Frequency&quot;) + xlab(&quot;Year and Month&quot;) +
           theme_bw() + opts(axis.text.x = theme_text(angle=90))
   

Here is my attempt based on gauden&#39;s answer:

    dates$Date &lt;- as.Date(dates$Date)
    ggplot(dates, aes(x=Date)) + geom_histogram(binwidth=30, colour=&quot;white&quot;) +
           scale_x_date(labels = date_format(&quot;%Y-%b&quot;),
                        breaks = seq(min(dates$Date)-5, max(dates$Date)+5, 30),
                        limits = c(as.Date(&quot;2008-05-01&quot;), as.Date(&quot;2012-04-01&quot;))) +
           ylab(&quot;Frequency&quot;) + xlab(&quot;Year and Month&quot;) +
           theme_bw() + opts(axis.text.x = theme_text(angle=90))

Plot based on edgester&#39;s approach:

![edgester-plot][1]

Plot based on gauden&#39;s approach:

![gauden-plot][2]

Note the following:

- gaps in gauden&#39;s plot for 2009-Dec and 2010-Mar; `table(dates$Date)` reveals that there are 19 instances of `2009-12-01` and 26 instances of `2010-03-01` in the data
- edgester&#39;s plot starts at 2008-Apr and ends at 2012-May. This is correct based on a minimum value in the data of 2008-04-01 and a max date of 2012-05-01. For some reason gauden&#39;s plot starts in 2008-Mar and still somehow manages to end at 2012-May. After counting bins and reading along the month labels, for the life of me I can&#39;t figure out which plot has an extra or is missing a bin of the histogram!

Any thoughts on the differences here? edgester&#39;s method of creating a separate count

---

Related References
---

As an aside, here are other locations that have information about dates and ggplot2 for passers-by looking for help:

- [Started here](http://learnr.wordpress.com/2010/02/25/ggplot2-plotting-dates-hours-and-minutes/) at learnr.wordpress, a popular R blog. It stated that I needed to get my data into POSIXct format, which I now think is false and wasted my time.
- [Another learnr post](http://learnr.wordpress.com/2009/05/05/ggplot2-two-time-series-with-different-dates/) recreates a time series in ggplot2, but wasn&#39;t really applicable to my situation.
- [r-bloggers has a post on this](http://www.r-bloggers.com/plotting-time-series-data-using-ggplot2/), but it appears outdated. The simple `format=` option did not work for me.
- [This SO question](https://stackoverflow.com/questions/6638696/breaks-for-scale-x-date-in-ggplot2-and-r) is playing with breaks and labels. I tried treating my `Date` vector as continuous and don&#39;t think it worked so well. It looked like it was overlaying the same label text over and over so the letters looked kind of odd. The distribution is sort of correct but there are odd breaks. My attempt based on the accepted answer was like so ([result here](http://i.stack.imgur.com/ntDPD.png)).


  [1]: http://i.stack.imgur.com/SQB95.png
  [2]: http://i.stack.imgur.com/qvXN5.png

Understanding dates and plotting a histogram with ggplot2 in R

I don&#39;t know how to make a list of lists in R.
I have several lists, I want to store them in one data structure to make accessing them easier. However, it looks like you cannot use a list of lists in R, so if I get list l1 from another list, say, l2 then I cannot access elements l1. How can I implement it?

**EDIT-** I will show an example of what does not work for me:

    list1 &lt;- list()
    list1[1] = 1
    list1[2] = 2
    list2 &lt;- list()
    list2[1] = &#39;a&#39;
    list2[2] = &#39;b&#39;
    list_all &lt;- list(list1, list2)
    a = list_all[1]
    a[2]
    #[[1]]
    #NULL
but `a` should be a list!



How can I make a list of lists in R?

I have a SQL Server 2008 R2 column containing a string which I need to split by a comma. I have seen many answers on StackOverflow but none of them works in R2. I have made sure I have select permissions on any split function examples. Any help greatly appreciated.

T-SQL split string

Here is the current code in my application:

    String[] ids = str.split(&quot;/&quot;);

When profiling the application, a non-negligeable time is spent string splitting. Also, the `split` method takes a regular expression, which is superfluous here.

What alternative can I use in order to optimize the string splitting? Is `StringUtils.split` faster?

(I would&#39;ve tried and tested myself but profiling my application takes a lot of time.)

Java split String performances

I have one file with `-|` as delimiter after each section...need to create separate files for each section using unix.

example of input file

    wertretr
    ewretrtret
    1212132323
    000232
    -|
    ereteertetet
    232434234
    erewesdfsfsfs
    0234342343
    -|
    jdhg3875jdfsgfd
    sjdhfdbfjds
    347674657435
    -|

Expected result in File 1

    wertretr
    ewretrtret
    1212132323
    000232
    -|

Expected result in File 2

    ereteertetet
    232434234
    erewesdfsfsfs
    0234342343
    -|

Expected result in File 3

    jdhg3875jdfsgfd
    sjdhfdbfjds
    347674657435
    -|

Split one file into multiple files based on delimiter

I am attempting to split a list into a series of smaller lists.

**My Problem:** My function to split lists doesn&#39;t split them into lists of the correct size. It should split them into lists of size 30 but instead it splits them into lists of size 114?

How can I make my function split a list into X number of Lists of size **30 or less**?

	public static List&lt;List&lt;float[]&gt;&gt; splitList(List &lt;float[]&gt; locations, int nSize=30) 
    {		
		List&lt;List&lt;float[]&gt;&gt; list = new List&lt;List&lt;float[]&gt;&gt;();
		
		for (int i=(int)(Math.Ceiling((decimal)(locations.Count/nSize))); i&gt;=0; i--) {
			List &lt;float[]&gt; subLocat = new List &lt;float[]&gt;(locations); 
			
			if (subLocat.Count &gt;= ((i*nSize)+nSize))
				subLocat.RemoveRange(i*nSize, nSize);
			else subLocat.RemoveRange(i*nSize, subLocat.Count-(i*nSize));
			
			Debug.Log (&quot;Index: &quot;+i.ToString()+&quot;, Size: &quot;+subLocat.Count.ToString());
			list.Add (subLocat);
		}
		
		return list;
	}

If I use the function on a list of size 144 then the output is:

&gt; Index: 4, Size: 120  
&gt; Index: 3, Size: 114  
&gt; Index: 2, Size: 114  
&gt; Index: 1, Size: 114  
&gt; Index: 0, Size: 114

Split a List into smaller lists of N size

I&#39;m trying to help out a coworker who accidentally created one feature branch from another feature branch, rather than creating the second one from master. Here is essentially what we have now…

    Master ---A---B---C
                       \
                  Foo   E---F---F---H
                                     \
                                Bar   J---K---L---M

And here is what we&#39;d like to have…

    Master ---A---B---C
                      |\
                 Foo  | E---F---F---H
                      |
                 Bar  J---K---L---M

One way I thought of would be to create FooV2 and BarV2 branches, and cherry-pick the individual commits into the appropriate V2 branches. But I&#39;m curious, is there a better way to handle this situation?



Content Type	Original Author	Original Content on Stackoverflow
Question	Etienne Low-Décarie	View Question on Stackoverflow
Solution 1 - R	IRTFM	View Answer on Stackoverflow
Solution 2 - R	jthetzel	View Answer on Stackoverflow
Solution 3 - R	danas.zuokas	View Answer on Stackoverflow
Solution 4 - R	zhan2383	View Answer on Stackoverflow

R strsplit with multiple unordered split arguments?

R Problem Overview

R Solutions

Solution 1 - R

Solution 2 - R

Solution 3 - R

Solution 4 - R

Why does GetManifestResourceStream returns null while the resource name exists when calling GetManifestResourceNames?

Getting filename (or path) from fstream

Attributions